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RECONSTRUCTION AND INTERPOLATION OF MANIFOLDS I: 
THE GEOMETRIC WHITNEY PROBLEM 

CHARLES FEFFERMAN, SERGEI IVANOV, YAROSLAV KURYLEV, 

MATTI LASS AS, HARIHARAN NARAYANAN 


Abstract. We study the geometric Whitney problem on how a Riemannian 
manifold ( M,g ) can be constructed to approximate a metric space ( X. dx ). 
This problem is closely related to manifold interpolation (or manifold learning) 
where a smooth n-dimensional surface S C R m , m > n needs to be constructed 
to approximate a point cloud in R m . These questions are encountered in differ¬ 
ential geometry, machine learning, and in many inverse problems encountered 
in applications. The determination of a Riemannian manifold includes the 
construction of its topology, differentiable structure, and metric. 

We give constructive solutions to the above problems. Moreover, we char¬ 
acterize the metric spaces that can be approximated, by Riemannian manifolds 
with bounded geometry: We give sufficient conditions to ensure that a met¬ 
ric space can be approximated, in the Gromov-Hausdorff or quasi-isometric 
sense, by a Riemannian manifold of a fixed dimension and with bounded di¬ 
ameter, sectional curvature, and injectivity radius. Also, we show that similar 
conditions, with modified values of parameters, are necessary. 

Moreover, we characterise the subsets of Euclidean spaces that can be ap¬ 
proximated in the Hausdorff metric by submanifolds of a fixed dimension and 
with bounded principal curvatures and normal injectivity radius. 

The above interpolation problems are also studied for unbounded metric 
sets and manifolds. The results for Riemannian manifolds are based on a 
generalisation of the Whitney embedding construction where approximative 
coordinate charts are embedded in R m and interpolated to a smooth surface. 
We also give algorithms that solve the problems for finite data. 


Keywords: Whitney’s extension problem, Riemannian manifolds, machine learn¬ 
ing, inverse problems. 
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1. Introduction and the main results 

1.1. Geometrization of Whitney’s extension problem. In this paper we de¬ 
velop a geometric version of Whitney’s extension problem. Let / : K — > R be a 
function defined on a given (arbitrary) set K C R", and let m > 1 be a given 
integer. The classical Whitney problem is the question whether / extends to a 
function F £ C m (R") and if such an F exists, what is the optimal C m norm of the 
extension. Furthermore, one is interested in the questions if the derivatives of F , 
up to order m, at a given point can be estimated, or if one can construct extension 
F so that it depends linearly on /. 

These questions go back to the work of H. Whitney [ZS EH [75] in 1934. In 
the decades since Whitney’s seminal work, fundamental progress was made by G. 
Glaeser [32] , Y. Brudnyi and P. Shvartsman [131 [12 El EH HU E] and [53] El [71] , 
and E. Bierstone-P. Milman-W. Pawluski [7]. (See also N. Zobin [ST] [53] for the 
solution of a closely related problem.) 

The above questions have been answered in the last few years, thanks to work of 
E. Bierstone, Y. Brudnyi, C. Fefferman, P. Milrnan, W. Pawluski, P. Shvartsman 
and others, (see [7] E] E] E] El El El El El El El E]-) Along the way, the 
analogous problems with C m (R n ) replaced by C ,m,aj (R rl ), the space of functions 
whose m th derivatives have a given modulus of continuity to, (see (32 S3]), were 
also solved. 

The solution of Whitney’s problems has led to a new algorithm for interpolation 
of data, due to C. Fefferman and B. Klartag [32 [36], where the authors show how 
to compute efficiently an interpolant F(x), whose C m norm lies within a factor C 
of least possible, where C is a constant depending only on m and n. 

In recent years, the focus of attention in this problem has moved to the direction 
when the measurements / : K —> R on the function / are given with errors bounded 
by e > 0. Then, the task is to find a function F : M" —>■ R such that sup x&K \F(x)~ 
f(x )| < e. Since the solution is not unique, one wants to find the extension that 
has the optimal norm in C' 7ra (R"), see e.g. [33 S3- Finding F can be considered 
as the task of Ending a graph T(F) = {( x,F(x )) : x £ 1"} C R" +1 of a function 
in C m (R”) that approximates the points {(x,f(x)) : x £ K}. To formulate the 
above problems in geometric (i.e. coordinates invariant) terms, instead of a graph 
set r (F), we aim to construct a general surface or a Riemannian manifold that 
approximates the given data. Also, instead of the C' m (R")-norms, we will measure 
the optimality of the solution in terms of invariant bounds for the curvature and 
the injectivity radius. 

In this paper we consider the following two geometric Whitney problems: 

A. Let E be a separable Hilbert space, e.g. R w , and assume that we are given 
a set X C E. When can one construct a smooth n-dimensional surface 
M C E that approximates X with given bounds for the geometry of M 
and the Hausdorff distance between M and XI How can the surface M 
can be efficiently constructed when X is given? 

B. Let ( X , dx) be a metric space. When there exists a Riemannian manifold 
(M, g) that has given bounds for geometry and approximates well XI How 
can the manifold (M, g) be constructed when X is given? 
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In Question B, by ‘approximation’ we mean Gromov-Hausdorff or quasi-isometric 
approximation, see definitions in Def. 11.31 and Section 12.11 

We answer the Question A in Theorem [2] below, by showing that if X C E is 
locally (i.e., at a certain small scale) close to affine n-dimensional planes, see Def. 
fL9l there is a surface M C E such that the Hausdorff distance of X and M is 
small and the second fundamental form and the normal injectivity radius of M are 
bounded. 

The answer to the Question B is given in Theorem |7] below. Roughly speaking, 
it asserts that the following natural conditions on X are necessary and sufficient: 
locally, X should be close to R”, and globally, the metric of X should be almost 
intrinsic. 

The conditions in Theorem[l]are optimal, up to multiplying the obtained bounds 
by a constant factor depending on n. Theorem [T] gives sufficient conditions for 
metric spaces that approximate smooth manifolds. In Corollary 11.41 we show that 
similar conditions, with modified values of parameters, are necessary. 

The result of Theorem [2] is optimal, up to multiplication the obtained bounds 
by a constant factor depending on n. 

The proofs of Theorems |T| and [2] are constructive and give raise to algorithms 
when X is a finite set. Moreover, we give algorithms that verify if a finite data set 
X satisfies the characterisations given in Theorems [l] and [2] 

Next we formulate the definitions needed to state the results rigorously. 

Notation. For a metric space X and sets 4,BcI, we denote by d^{A,B), or 
just by dn(A, B), the Hausdorff distance between A and B in X. 

By cIgh{X,Y) we denote the Gromov-Hausdorff (GH) distance between metric 
spaces X and Y. For the reader’s convenience, we collect definitions and elemen¬ 
tary facts about the GH distance in section 12.11 For more detailed account of the 
topic, see e.g. [2fll IHTl [68] . In most cases we work with pointed GH distance between 
pointed metric spaces (X,xo) and (Y,y o), where xo € X and yo £ Y are distin¬ 
guished points. For the definition of pointed GH distance, see HU §1.2 in Ch. 10]) 
or section o 

For a metric space X, x £ X and r > 0, we denote by Z?; Y (x) or B r {x) the 
ball of radius r centered at x. For X = R™, we use notation B™(x) = Bf (x) and 
= B™( 0). For a set Ac X and r > 0, we denote by U*(A) or U r (A) the metric 
neighborhood of A of radius r, that is the set points within distance r from A. 

When speaking about GH distance between metric balls B*{x) and B ]T (y), we 
always mean the pointed GH distance where the centers x and y are distinguished 
points of the balls. We abuse notation and write dcH^B* (x), B} (y)) to denote 
this pointed GH distance. 

For a Riemannian manifold M, we denote by Sec m its sectional curvature and 
by inj M its injectivity radius. 

Small metric balls in a Riemannian manifold are GH close to Euclidean balls. 
More precisely, let M be a Riemannian n-manifold with | SecM | < K and inj M > 
2po where K and po are positive constants, and 0 < r < min{Rf -1 / 2 , po}. Then 
the metric ball B^ r (x) in M and the Euclidean ball = J3® (0) satisfy 

(1.1) d GH (B™(x),B?)<Kr 3 . 

For a proof of this estimate, see section [6] 

If M is a submanifold of R^, one can write a similar estimate for the Hausdorff 
distance in R w . Namely if the principal curvatures of M are bounded by n > 0, 
then M deviates from its tangent space by at most \nr 2 within a ball of radius r. 
Thus the Hausdorff distance between r-ball B^ 1 {x) in M and the ball Bj xM {x) = 
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B x (x) n T X M of the affine tangent space of M at x satisfy 

(1.2) d H (B™(x),Bj* M (x)) < \nr 2 . 

Note the different order of the above estimates for the intrinsic distances ED and 
the extrinsic distances ED- 

With (11.11) in mind, we give the following definition. 

Definition 1.1. Let A be a metric space, r > 8 > 0, n £ N. We say that X is 
8-close to R n at scale r if, for any x £ X, 

(1.3) d GH (B?(x),B?)<5. 

Condition ED can be effectively verified, up to a constant factor, see Algorithm 
GHDist below. The condition can be also formulated for finite subsets: If sequences 
(yj)jLi c B™ and C B*(x) are f-nets such that \dRn(yj,y k )-dx(xj,Xk)\ < 

| for all j, k = 1,2,..., N, then (II.3D is valid by Q3 Prop. 7.3.16 and Cor. 7.3.28]. 
On the other hand, if X is ^r-close to R" at scale r, then such |-nets exists. 

In a Riemannian manifold, large-scale distances are determined by small-scale 
ones through the lengths of paths. However Definition 11.11 does not impose any 
restrictions on distances larger that 2 r in X. To rectify this, we need to make the 
metric ‘almost intrinsic’ as explained below. 

Definition 1.2. Let X = (X, d) be a metric space and <5 > 0. A 8-chain in A is a 
finite sequence xi, X 2 , ■ ■ ■, Xn £ X such that d(xi,Xi+i) < 8 for all 1 < i < N — 1. 
A sequence Xi, X 2 , ■ ■ ■, X ]v £ A is said to be 8-straight if 

(1.4) d(xi,Xj) + d{xj,x k ) < d(xi,Xk) + 8 

for all 1 < * < j < k < N. We say that A is 8-intrinsic if for every pair of points 
x, y £ A there is a (5-straight (5-chain x±,... ,Xn with x\ = x and Xn = V- 

Clearly every Riemannian manifold (more generally, every length space) is 8- 
intrinsic for any <5 > 0. Moreover, if A lies within GH distance 8 from a length 
space, then A is CJ-intrinsic. In fact, this property characterizes <5-intrinsic metrics, 
see Lemma El 

If a metric space A = (A, d) is (5-close to 1" at scale r > 8 (see Definition 
ED. then one can change ‘large’ distances in A so that the resulting metric is 
C(5-intrinsic and coincides with d within balls of radius r. The new distances are 
measured along ‘discrete shortest paths’ in A. For details, see ED and Lemma 
12.31 in section 12.21 

In order to conveniently compare metric spaces at both small scale and large 
scale, we need the notion of quasi-isometry. 

Definition 1.3. Let A, Y be metric spaces, e > 0 and A > 1. A (not necessarily 
continuous) map /: A — > Y is said to be a (A, e)- quasi-isometry if the image /(A) 
is an e-net in Y and 

(1.5) X~ 1 d x (x,y) - e < d Y (f(x),f(y)) < \d x {x,y) +e 

for all x,x' £ A, where dx and dy denote the distances in A and Y, resp. 

Unlike the use of quasi-isometries in e.g. geometric group theory, in this paper 
we consider quasi-isometries with parameters e ~ 0 and A « 1. The quasi-isometry 
relation is almost symmetric: if there is a (A, e)-quasi-isometry from A to Y, then 
there exists a (A, CAe)-quasi-isometry from Y to A, where C is a universal constant. 
We say that metric spaces A and Y are (A, e) -quasi-isometric if there is a (A, e)- 
quasi-isometry in either direction. 
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The existence of (A, e)-quasi-isometry f: X Y implies that 

(1.6) d GH {X,Y) < 2(A-l)diam(A) + 2e. 

If X and Y are intrinsic (or e-intrinsic), a similar estimate holds for metric balls: 

(1.7) dGH(B%(x),B*(f(x))) <C(\-1)R + Ce 
for every x £ X and R > 0. See section f?Tl for the proof. 

Now we formulate our main result. 

Theorem 1. For every n £ N there exist o\ = o\ (n) > 0 and C = C(n) > 0 such 
that the following holds. Let X be a metric space, r > 0 and 

( 1 . 8 ) 0 < 6 < air. 

Suppose that X is 6-intrinsic and S-close to R" at scale r, see Definitions \l.l\ and 
m Then there exists a complete n-dimensional Riemannian manifold M such that 

(1) X and M are (1 + (75r _1 ,C5)-quasi-isometric and therefore 

(1.9) d G H{X, M) < C5r~ l diam(A). 

(2) The sectional curvature Sec m of M satisfies | SecM | < C5r~ 3 . 

(3) The injectivity radius of M is bounded below by r/2. 

The estimate m follows from the existence of a (1 + CSr 1 , C'<5)-quasi-isometry 
from X to M due to (11.61) and the fact that diam(X) > r. The proof of Theorem[T| 
is given in Section [I] 

The quasi-isometry parameters and sectional curvature bound in Theorem [T| are 
optimal up to constant factors depending only on n, see Remark 14.201 

Furthermore, Theorem [l] gives a characterisation result for metric spaces that 
GH approximate smooth manifolds with certain geometric bounds. The precise 
formulation is the following. 

Let A4(n, K, io, D) denote the class of ?r-dimensional compact Riemannian man¬ 
ifolds M satisfying | SecM | < K, inj M > i 0 , and diam(M) < D. Denote by 
M e (n, K,i 0l D) the class of metric spaces X such that d G n(X, M) < e for some 
M £ M{n, K, i 0 , D). Also, let X{n, <5, r, D) denote the class of metric spaces X that 
are <5-intrinsic and (5-close to R n at scale r, and satisfy diarn(A) < D. Theorem [T| 
has the following corollary that concerns neighbourhoods of smooth manifolds and 
the class of metric spaces that satisfy a weak 5-flatness condition in the scale of 
injectivity radius and a strong 5-flatness condition in a small scale r. 

Corollary 1.4. For every n £ N there exist 02 = 02 (n) > 0 and C = C(n ) > 0 
such that the following holds. Let K,io,D > 0 and assume that io < \/a^/K. Let 
5o = KPq, 0 < 5 < 5o, and r = (S/K) 3 . Let X be the class of metric spaces defined 
by 

X := X{n, 5, r, D ) D X(n, So, io, D). 

Then 

(1.10) M Sl (n, K/2, 2i 0 , D — S) C X C M e2 (n, CK, i 0 /4, D) 
where e 1 =5/6 and £2 = CDK l / 3 5 2 / 3 . 

The optimal values of eq and £2 in Corollary 11.41 remains an open question. The 
proof of Corollary 11.41 is given at the end of section [I] It is based on Theorem [I] 
and Proposition 11.71 below. 

In Corollary 11.41 the first inclusion in (11.101) means that X £ X is a nec¬ 
essary condition that a metric space X approximates a smooth manifold M £ 
M(n,K/2,2io,D — 5) with accuracy e\. Likewise, the second inclusion in (11.101) 
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implies that X £ X is a sufficient condition that a metric space X approximates a 
smooth manifold M € M(n,CK,io/4, D) with accuracy £ 2 - 

We note that an algorithm based on Theorem Q] that also summarises some of 
the main objects used in the proof of the theorem, is given in Section [5j see also 
Fig. 14.21 

In the proof of Theorem [T] M is constructed as a submanifold of a separable 
Hilbert space E, which is either with a large N (in case when X is bounded) 
or t 2 endowed with the the standard || • || £2 norm. However the Riemannian metric 
on M is different from the one inherited from E. 

Here is the idea of the proof of Theorem [Q Since the r-balls in X are GH close 
to the Euclidean ball £?", they admit nice maps (2<5-isometries) to B™. These maps 
can be used as a kind of coordinate charts for X, allowing us to argue about X as 
if it were a manifold. In particular, we can mimic the proof of Whitney Embedding 
Theorem (on classical Whitney embeddings, see [79; 180]). If X were a manifold, 
this would give us a diffeomorphic submanifold of a higher-dimensional Euclidean 
space E. In our case we get a set E C E which is a Hausdorff approximation of a 
submanifold M C E. In order to prove this, we use Theorem [2] (see subsection 11.31 
below) which characterizes sets approximable by (nice) submanifolds. We empha¬ 
size that the resulting submanifold M C E is the image of a Whitney embedding 
but not of a Nash isometric embedding [54] tSS • As the last step of the construction 
(see section m , we construct a Riemannian metric g on M so that a natural map 
from X to (M, g ) is almost isometric at scale r. The construction is explicit and can 
be performed in an algorithmic manner, see section [5] Then, with the assumption 
that X is ^-intrinsic, it is not hard to show that X and (M, g) are quasi-isometric 
with small quasi-isometry constants. 

Convention. Here and later we fix the notation n for the dimension of a (submani¬ 
fold in question. Throughout the paper we denote by c, C , C\, etc., various con¬ 
stants depending only on n and, when dealing with derivative estimates, on the 
order of the derivative involved. To indicate dependence on other parameters, we 
use notation like C(M 1 k ) or Cm,u for numbers depending on manifold M and num¬ 
ber k. The same letter C can be used to denote different constants, even within 
one formula. 

1.2. Manifold reconstruction and inverse problems. Theorem [l] and Corol¬ 
lary 11.41 give quantitative estimates on how one can use discrete metric spaces as 
models of Riemannian manifolds, for example for the purposes of numerical anal¬ 
ysis. With this approach, a data set representing a Riemannian manifold is just a 
matrix of distances between points of some <5-net. Naturally, the distances can be 
measured with some error. In fact, only ‘small scale’ distances need to be known, 
see Corollary II.81 below. 

The statement of Theorem [T] provides a verifiable criterion to tell whether a given 
data set approximates any Riemannian manifold (with certain bounds for curvature 
and injectivity radius). See section f2.31 for an explicit algorithm. 

The proof of Theorem |T] is constructive. It provides an algorithm, although a 
rather complicated one, to construct a Riemannian manifold approximated by a 
given discrete metric space X. See section [5] for an outline of the algorithm. 

Next we formulate results that describe properties of the manifold M constructed 
from data X that approximates some smooth manifold M and discuss how this 
result is used in inverse problems. 


1.2.1. Reconstructions with data that approximate a smooth manifold. When deal¬ 
ing with inverse problems, it is assumed that the data set X comes from some 
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unknown Riemannian manifold M, and moreover some a priori bounds on the ge¬ 
ometry of this manifold are given. Applying Theorem |T] to this data set yields 
another manifold M which is (1 + CJr -1 , C5)-quasi-isometric to M. One naturally 
asks what information about the original manifold M can be recovered. An answer 
is given by the following proposition. 

Proposition 1.5 (cf. Theorem 8.19 in [43]). There exist op = cro(n) > 0 and 
C = C{n ) > 0 such that the following holds. Let M and M be complete Riemannian 
n-manifolds with | Sec m | < K and | Sec^y | < K, where K > 0. 

Let 0 < a < (Jo and assume that M and M are (1 + er, or)-quasi-isometric, where 
r < min{(f7/A') 1 /Vnj M ,inj^}. 

Then M and M are diffeomorphic. Moreover there exists a bi-Lipschitz diffeo- 
morphism between M and M with bi-Lipschitz constant bounded by 1 + Ca. 

We do not prove Proposition 11.51 because it is essentially the same as Theorem 
8.19 in [ 351 except that the approximation is quasi-isometric rather than GH. To 
prove Proposition 11.51 one can apply the same arguments as in [43[ 8.19] using 
coordinate neighborhoods of size r. The estimates are not given explicitly in [43] 
but they follow from the argument. These results can be regarded as quantitative 
versions of Cheeger’s Finiteness Theorem [55] . see [SI1 Ch. 10] and [50] for different 
proofs. 

Remark 1.6. Using results of [I] one can show that M and M in Proposition ll.5l are 
close to each other in C 1,a topology. However we do not know explicit estimates in 
this case. 

1.2.2. An improved estimate for the injectivity radius. The injectivity radius esti¬ 
mate provided by Theorem [T] is not good enough in the context of manifold recon¬ 
struction. Indeed, in order to obtain a good approximation one has to begin with a 
small r. (Recall that for Theorem Q] to work, S should be of order ATr 3 where K is 
the curvature bound.) However Theorem [T| guarantees only a lower bound of order 
r for inj M . so a priori one could end up with an approximating manifold M with a 
very small injectivity radius. In order to rectify this we need the following result. 

Proposition 1.7. There exists C = C{n ) > 0 such that the following holds. 
Let K > 0 and let M , M be complete n-dimensional Riemannian manifolds with 
I Secjvf | < K and | Sec^y | < K. 

1. Let x £ M, x £ M, and 0 < p < min{inj^y('r), -fj=}. Then 

(1.11) > p - C • d GH (B^ (x), Bf (x)). 

2. Suppose that M and M are (1 + i5r _1 , 5)-quasi-isometric where 5 > 0 and 

( 1 . 12 ) 0 < r < minjmj^z), ^ 7 =}. 

Then 

(1.13) inj M > (1 - CSr- 1 ) min{inj^, -fe}. 

The situation described in the second part of Proposition 11.71 occurs when M 
and M are two manifolds approximating the same metric space X as in Theorem [T] 
or when M is a reconstruction of M as in Corollary 11.81 below. The proof of 
Proposition 1 1. 71 is given in section [6] 
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1.2.3. An approximation result with only one parameter. We summarize the man¬ 
ifold reconstruction features of Theorem [1] in the following corollary where all ap¬ 
proximations, errors in data, as well as the errors in the reconstruction are given in 
terms of a single parameter 6. Essentially, the corollary tells that a manifold N can 
be approximately reconstructed from a i5-net X of N and the information about 
local distances between points of X containing small errors. This type of results 
are useful e.g. in inverse problems discussed below. 


Corollary 1.8. Let K > 0, n € Z + and N be a compact n-dimensional manifold 
with sectional curvature bounded by | SeCjv | < K. There exists Sq = 8o(n,K) such 
that if 0 < 5 < So then the following holds: 

Let r = ( S/K j 1 / 3 and suppose that the injectivity radius injjy of N satisfies 
injjY > 2r. Also, let X = {xj : j = 1,2,..., J} C N be a 8-net of N and 
M+ U {0} be a function that satisfies for all x, y £ X 


(1.14) 


\d(x,y) - d N (x,y)\ < 5, ifd N (x,y)<r , 


and 

d(x,y)>r-5 , ifd N (x,y)>r. 

Given the set X and the function d, one can effectively construct a compact n- 
dimensional Riemannian manifold (M, g ) such that: 

(1) There is a diffeomorphism F : M —X N satisfying 


(1.15) 


l_ < d N (F(x),F(y)) 
L ~ d M (x,y) 


for all x, y £ M, 


where L = 1 + CK 1 / 3 8 2 / 3 . 

(2) There is C\ = C\(n ) > 0 such that the sectional curvature Sec m of M 
satisfies | SecM | < C\K. 

(3) The injectivity radius inj M of M satisfies 


inj M > min{(CiA') 1/2 , (1 - CK 1/3 8 2/3 ) inj^}. 


The proof of Corollary 11.81 is given in the end of Section [5] 

We call the function d:IxI-> R + U {0}, defined on the 5-net X and satisfy¬ 
ing the assumptions of Corollary 11.81 an approximate local distance function with 
accuracy 8. Many inverse problems can be reduced to a setting where one can deter¬ 
mine the distance function div(xj,Xk), with measurement errors eyfc, in a discrete 
set {xj}j e j C N. Thus, if the set {xj}j e j is (5-net in N, the errors ej t k satisfy con¬ 
ditions (11.141) , and 8 is small enough, then the diffeomorphism type of the manifold 
can be uniquely determined by Corollary 11.81 Moreover, the bi-Lipschitz condition 
(11.151) means that also the distance function can be determined with small errors. 
We emphasize that in (11.141) one needs to approximately know only the distances 
smaller than r = (S/K) 1 / 3 . The larger distances can be computed as in (22). 


1.2.4. Manifold reconstructions in imaging and inverse problems. Recently, geo¬ 
metric models have became an area of focus of research in inverse problems. As 
an example of such problems, one may consider an object with a variable speed of 
wave propagation. The travel time of a wave between two points defines a natural 
non-Euclidean distance between the points. This is called the travel time metric 
and it corresponds to the distance function of a Riemannian metric. In many topi¬ 
cal inverse problem the task is to determine the Riemannian metric inside an object 
from external measurements, see e.g. ISBElETlEgiTgiBi. These problems are 
the idealizations of practical imaging tasks encountered in medical imaging or in 
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Earth sciences. Also, the relation of discrete and continuous models for these prob¬ 
lems is an active topic of research, see e.g. 013 Hi \M- In these results discrete 
models have been reconstructed from various types of measurement data. However, 
a rigorously analyzed technique to construct a smooth manifold from these discrete 
models to complete the construction has been missing until now. 

In practice the measurement data contains always measurement errors and is 
limited. This is why the problem of the approximate reconstruction of a Riemannian 
manifold and the metric on it from discrete or noisy data is essential for several 
geometric inverse problems. Earlier, various regularization techniques have been 
developed to solve noisy inverse problems in the PDE-setting, see e.g. E3 Eg, 
but most of such methods depend on the used coordinates and, therefore, are not 
invariant. One of the purposes of this paper is to provide invariant tools for solving 
practical imaging problems. 

An example of problems with limited data is an inverse problem for the heat ker¬ 
nel, where the information about the unknown manifold (M, g) is given in the form 
of discrete samples (h,M(xj ,yk,ti))j,kej,iei of the heat kernel h,M(x,y,t), satisfying 

(d t - A g )h M (x, y, t) = 0, on (x, t) G M x R+, 
h M {x,y,0) = 8 y {x), 

where the Laplace operator A g operates in the x variable, see e.g. [35]. Here 
Vi = x ii w here {xj : j G J} is a finite e-net in an open set H C M, while 
{ti : i € 1} is in e-net of the time interval (to, ti). It is also natural to assume that 
one is given measurements h^\xj,yk,ti) of the heat kernel with errors satisfying 
I h^\xj,yk,ti) — hM(x jt y k ,ti)\ < e. Several inverse problems for wave equation 
lead to a similar problem for the wave kernel GM{x,y,t) satisfying 

(df - A g )G M (x, y, t ) = 8 0 (t)6 y (x), on (z, t) G M x K, 
GM{x,y,t)=0, for t < 0, 

see e.g. |35j 051 In the case of complete data (corresponding to the case when 
s vanishes), the inverse problem for heat kernel and wave kernel are equivalent 
to the inverse interior spectral problem, see HZ3- In this problem one considers 
the eigenvalues Xt of —A s , counted by their multiplicity, and the corresponding 
L 2 (M)-orthonormal eigenfunctions, <pic(x) that satisfy 

-A g <Pk(x) = \ k tpk{x), x € M. 

In the inverse interior spectral problem one assumes that we are given the first N 
smallest eigenvalues, A&, k = 1,2 ,,N, and values ^ n \xj) at the e-net points 
{Xj ; j G J} C H, where \^™\xj) — tpk(xj)\ < e and 11 C M is open. It is 
shown in 0 0H] that these data determine a metric space (X,dx) which is a 8 
GH-approximation to the unknown manifold M, where <5 = 5(e,N;fl) tends to 0 
as e —> 0 and N —> oo. It should be noted that the above works deal with the 
case of manifolds with boundary and the Laplace operators with some classical 
boundary conditions, however, the constructions used there can be immediately 
extended to the case of a compact Riemannian manifold with data in a bounded 
open subdomain. Theorem [l] completes the solution of the above inverse problems 
by constructing a smooth manifold that approximates M. 

1.3. Interpolation of manifolds in Hilbert spaces. As already mentioned, in 
the proof of Theorem 0] we need to approximate a set in a Hilbert space by an 
n-dimensional submanifold (with bounded geometry). At small scale, the set in 
question should be close to affine subspaces in the following sense. 
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Definition 1.9. Let E be a Hilbert space, X C E, n G N and r, 8 > 0. We say 
that X is 5-close to n-flats at scale r if for any x G X, there exists an n-dimensional 
affine space A x C E through x such that 

(1.16) d H (XnB?(y), A x nB?(x)) <5. 

To formulate our result for the sets in Hilbert spaces, we recall some definitions. 
By a closed submanifold of a Hilbert space E we mean a finite-dimensional smooth 
submanifold which is a closed subset of E. One can show that a finite-dimensional 
submanifold M C E is closed if and only if it is properly embedded, that is the 
inclusion M ^ E is a proper map. 

Let M C E be a closed submanifold. The normal injectivity radius of M is the 
supremum of all r > 0 such that normal exponential map of M is diffeomorphic in 
the r-neighborhood of the null section of the normal bundle of M. Let r be the 
normal injectivity radius of M and assume that r > 0. Then for every x G U r (M ) 
there exists a unique nearest point in M. We denote this nearest point by Pm{x) 
and refer to the map Pm - U r (M) —> M as the normal projection. Note that the 
normal projection is a smooth map. 

Theorem 2. For every n G N there exists <jq = cr 0 (n) > 0 such that the following 
holds. Let E be a separable Hilbert space, X C E, r > 0 and 

(1.17) 0 < 5 < (Tor. 

Suppose that X is 5-close to n-flats at scale r (see Def. \W- Then there exists a 
closed n-dimensional smooth submanifold M C E such that: 

(1) d H (X, M) < 55. 

(2) The second fundamental form of M at every point is bounded by C5r~ 2 . 

(3) The normal injectivity radius of M is at least r/3. 

(4) The normal projection Pm'- U r / 3 (M) —>• M is globally C-Lipschitz, i.e. 

(1.18) \Pm{x) - P M (y)\ < C\x-y\ 
for all x,y G U r / 3 (M), and satisfies 

(1.19) \\d k x P M \\ < C n , k 5r~ k 
for all k > 2 and x G U r / 3 (M). 

The proof of Theorem [2] is given in Section [3] 

We note that an algorithm based on Theorem [2] that summarises also the main 
objects used in its proof, is given in Section 0 see also Fig. [3] 

In Remark l3. 141 below we show that the bounds in claims (2) and (3) in Theorem 
[2] are optimal, up to constant factors depending on n. Thus Theorem [2] gives 
necessary and sufficient conditions (up to multiplication of the bounds by a constant 
factor) for a set X C E to approximate a smooth submanifold with given geometric 
bounds. 

Notation. In (11.191) and throughout the paper, dff denotes the mth differential 
of a smooth map. The norm of the mth differential is derived from the Euclidean 
norm on E in the standard way. We extend this notation to the case to = 0 by 
setting d x f = f(x) for any map /. As usual, we define the C' m -norm of a map / 
defined on an open set U C E, by 

\\f\\c^(u) = sup max \\d x f\\. 

xGU 0</c<m 

In order to approximate a submanifold M as in Theorem (5J the set X must 
contain as many points as a C(5-net in M. This is an unreasonably large number of 
points when 5 is small. The following corollary allows one to reconstruct M from a 
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smaller approximating set. It involves two parameters e and <5 where e is a ‘density’ 
of a net and 5 is a ‘measurement error’. Note that 6 may be much smaller than e. 
A similar generalization is possible for Theorem [l] but we omit these details. 

Corollary 1.10. For every n £ N there exists ao = ao (n) > 0 such that the 
following holds. Let E be a Hilbert space, X C E, 0 < e < r/10 and 0 < S < aor. 
Suppose that for every x £ X there exists an n-dimensional affine subspace A x C E 
such that the set X n B r (x) is within Hausdorff distance S from an e-net of the 
affine n-ball A x fl B r {x). 

Then there exists a closed n-dimensional submanifold M C E satisfying proper¬ 
ties 2-f of Theorem 0] and such that X is within Hausdorff distance CS from an 
e-net of M. 

Proof sketch. Below, the symbol Z denotes the angle between n-dimensional affine 
subspaces of E. 

Consider the set X' = H B r (x)) C E. A suitably modified version of 

Lemma HT721 implies that Z(A x ,A y ) < CSr -1 for all x,y £ X such that \x — y\ < r. 
It then follows that X' is UJ-close to n-flats at scale r — C5. Now the corollary 
follows from Theorem [2] applied to A'. □ 

1.4. Surface interpolation and Machine Learning. The results of this paper 
solve some classical problems in Machine Learning. Next we give a short review on 
existing methods and discuss how Theorem 0] is applied for problems of Manifold 
Learning. 

1.4.1. Literature on Manifold Learning. The following methods aim to transform 
data lying near a d-dimensional manifold in an N dimensional space into a set 
of points in a low dimensional space close to a d-dimensional manifold. During 
transformation all of them try to preserve some geometric properties, such as ap¬ 
propriately measured distances between points of the original data set. Usually the 
Euclidean distance to the ‘nearest’ neighbours of a point is preserved. In addition 
some of the methods preserve, for points farther away, some notion of geodesic 
distance capturing the curvature of the manifold. 

Perhaps the most basic of such methods is ‘Principal Component Analysis’ 
(PCA), [59] 04] where one projects the data points onto the span of the d eigen¬ 
vectors corresponding to the top d eigenvalues of the ( N x N) covariance matrix of 
the data points. 

An important variation is the ‘Kernel PCA’ [57 where one defines a feature 
map $(•) mapping the data points into a Hilbert space called the feature space. 
A ‘kernel matrix’ K is built whose ( i,j) th entry is the dot product ($(aij), $(xj)) 
between the data points Xi,Xj. From the top d eigenvectors of this matrix, the 
corresponding eigenvectors of the covariance matrix of the image of the data points 
in the feature space can be computed. The data points are projected onto the span 
of these eigenvectors of this covariance matrix in the feature space. 

In the case of ‘Multi Dimensional Scaling’ (MDS) [58], only pairwise distances 
between points are attempted to be preserved. One minimizes a certain ‘stress 
function’ which captures the total error in pairwise distances between the data 
points and between their lower dimensional counterparts. For instance, a raw stress 
function could be £(||:Ej — Xj\\ — \\yi — yj\\) 2 , where Xi are the original data points, 
yi, the transformed ones, and \\xi — Xj ||, the distance between Xi,Xj. 

‘Isomap ’ [73j attempts to improve on MDS by trying to capture geodesic dis¬ 
tances between points while projecting. For each data point a ‘neighbourhood 
graph’ is constructed using its k neighbours ( k could be varied based on various 
criteria), the edges carrying the length between points. Now shortest distance 


12 


FEFFERMAN, IVANOV, KURYLEV, LASSAS, NARAYANAN 


between points is computed in the resulting global graph containing all the neigh¬ 
bourhood graphs using a standard graph theoretic algorithm such as Dijkstra’s. Let 
D = [ dij} be the nx n matrix of graph distances. Let S = [d|-] be the nx n matrix 
of squared graph distances. Form the matrix A = ^HSH , where H = / — n -1 ll J . 
The matrix A is of rank t < n, where t is the dimension of the manifold. Let 
A 5 = i HS y H , where [S Y ]ij = ||yi — y 3 || 2 . Here the yt are arbitrary f-dimensional 
vectors. The embedding vectors yi are chosen to minimize ||A — A y |. The opti¬ 
mal solution is given by the eigenvectors V\ , ... , Vt corresponding to the t largest 
eigenvalues of A. The vertices of the graph G are embedded by the t x n matrix 

Y = (yi, ...,y n ) = (y/xiv lt • ■ •, y/%v t ) T ■ 


‘Maximum Variance Unfolding’ (MVU) [75j also constructs the neighbourhood 
graph as in the case of Isomap but tries to maximize distance between projected 
points keeping distance between nearest points unchanged after projection. It uses 
semidefinite programming for this purpose. 

In ‘Diffusion Maps’ [37j, a complete graph on the data points is built and each 


II II 

edge is assigned a weight based on a gaussian: niq = e ^ . Normalization is 

performed on this matrix so that the entries in each row add up to 1. This matrix 
is then used as the transition matrix P of a Markov chain. P* is therefore the 
transition probability between data points in t steps. The d nontrivial eigenvalues 
A i and their eigenvectors Vi of P 4 are computed and the data is now represented 
by the matrix [Ai'iq, • • • , A^t'd], with the row i corresponding to data point a 

The following are essentially local methods of manifold learning in the sense that 
they attempt to preserve local properties of the manifold around a data point. 

‘Local Linear Embedding’ (LLE) [BT] preserves solely local properties of the data. 
Let Ni be the neighborhood of Xi, consisting of k points. Find optimal weights w.j 3 
by solving W := argminw \\%i — Yj =i w ij x j ll 2 > subject to the constraints (i) 
= 1) (ii) Wij > 0, (iii) wtj = 0 if j ^ N^. Once the weight matrix 
W is found a spectral embedding is constructed using it. More precisely, a matrix 
Y is is a t x n matrix constructed satisfying Y = argmin^ Tr{YMY T ), under the 
constraints Y 1 = 0 and YY T = nl t , where M = (I n — W) T (I n — W). Y is used 
to get a t-dimensional embedding of the initial data. 

In the case of the ‘Laplacian Eigenmap’ |3]> 46J again, a nearest neighbor graph 
is formed. The details are as follows. Let n t denote the neighborhood of i. Let 
W = (w-ij ) be a symmetric (nx n) weighted adjacency matrix defined by (i) = 0 

if j does not belong to the neighborhood of i\ (ii) = exp(||x» — Xj || 2 /2cr 2 ), if 
Xj belongs to the neighborhood of x. L . Here cr is a scale parameter. Let G be the 
corresponding weighted graph. Let D = (dij) be a diagonal matrix whose i th entry 
is given by (Wl)j. The matrix L = D — W is called the Laplacian of G. We seek 
a solution in the set of txn matrices Y = &rgmm Y -.YDY T =i t Tr (YLY T ). The rows 


of Y are given by solutions of the equation Lv = A Dv. 

Hessian LLE (HLLE) (also called Hessian Eigenmaps) HDj and ‘Local Tangent 
Space Alignment’ (LTSA) |8Tj attempt to improve on LLE by also taking into 
consideration the curvature of the higher dimensional manifold while preserving 
the local pairwise distances. We describe LTSA below. 

LTSA attempts to compute coordinates of the low dimensional data points and 
align the tangent spaces in the resulting embedding. It starts with computing bases 
for the approximate tangent spaces at the datapoints x 3 by applying PCA on the 
neighboring data points. The coordinates of the low dimensional data points are 
computed by carrying out a further minimization miriYi ,c, : U,11 Yj ./*, — LiQi\\ 2 . Here 
Y; has as its columns, the lower dimensional vectors, Jk is a ‘centering’ matrix, 
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Qi has as its columns the projections of the k neighbors onto the d eigenvectors 
obtained from the PCA and Li maps these coordinates to those of the lower di¬ 
mensional representation of the data points. The minimization is again carried out 
through suitable spectral methods. 

The alignment of local coordinate mappings also underlies some other methods 
such as ‘Local Linear Coordinates’ (LLC) [5^ and ‘Manifold Charting’ jTTj . 

There are also methods which map higher dimensional data points to lower di¬ 
mensional piecewise linear manifolds (as opposed to smooth manifolds). Under 
this restriction these methods produce optimal manifolds. The manifold is a sim- 
plicial complex in the case of Cheng et al [ 25 ] and a witness complex in the case of 
Boissonnat et al [5]. 

Each of the algorithms is based on strong domain based intuition and in general 
performs well in practice atleast for the domain for which it was originally intended. 
PCA is still competitive as a general method. 

Some of the algorithms are known to perform correctly under the hypothesis 
that data lie on a manifold of a specific kind. In Isomap and LLE, the manifold has 
to be an isometric embedding of a convex subset of Euclidean space. In the limit 
as number of data points tends to infinity, when the data approximate a manifold, 
then one can recover the geometry of this manifold by computing an approximation 
of the Laplace-Beltrami operator. Laplacian Eigenmaps and Diffusion maps rest 
on this idea. LTSA works for parameterized manifolds and detailed error analysis 
is available for it. 

1.4.2. Theorems^ and\^ and the problems of machine learning. The Theorem|T|ad- 
dresses the fundamental question, when a given metric space (X, dx), correspond¬ 
ing to data points and their ‘abstract’ mutual distances, approximate a Riemannian 
manifold with a bounded sectional curvature and injectivity radius. In the context 
of Theorem [TJ the distances are measured in intrinsic sense in M and X. 

Theorem [2] deals with approximating a subset of a Hilbert space E satisfying 
certain local constraints by a manifold having a bounded second fundamental form 
and normal injectivity radius. In the context of Theorem [2] the distances are 
measured in extrinsic sense in E. Such approximations have extensively been con¬ 
sidered in machine learning or, more precisely, manifold learning and non-linear 
dimensionality reduction, where the goal is to approximate the set of data lying in 
a high-dimensional space like E by a submanifold in If of a low enough dimension 
in order to visualize these data, see e.g. references of Section H .4. II 

The results of this paper provide for the observed data an abstract low-dimen¬ 
sional representation of the intrinsic manifold structure that the data may possess. 
In particular, the topology of the manifold structure is determined, assuming that 
the sampling density has been sufficient. As described in Section [3] the proof of 
Theorem 2] is of a constructive nature and provides an algorithm to perform such 
visualisation. Note that this algorithm starts with tangent-type planes which makes 
it distantly similar to the LTSA method in machine learning, see e.g. [5211ST] . In 
paper EU, the authors provide a method of visualization of a given data using 
a probabilistic setting. In comparison, Theorem [2] helps us visualize data in a 
deterministic setting. 

Another application of the results of the paper to machine learning deals with 
the spectral techniques to perform dimensionality reduction, see e.g. El II ID- Using 
the constructions of Section m we can associate with the data set not only the 
metric structure but also point measures and use the constructions of |22j to find 
approximately the eigenvalues and eigenfunctions of the sought for manifold. 

The results of this paper also have implications for a probabilistic model of the 
data. Thus both Theorem 1, which involves an abstract manifold, and Theorem 
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2 which involves an embedded manifold have implications for Machine Learning. 
We would be assuming that data is drawn using independent and identically dis¬ 
tributed (i.i.d.) samples from a probability distribution supported on a manifold 
with random, e.g. gaussian, noise. As the amount of data increases, with high 
probability, the Hausdorff distance of the set of corrupted samples to the manifold 
first decreases (if the noise is sufficiently small) and then increases. We stop draw¬ 
ing data at the point where the high-probability bound on the Hausdorff distance 
begins to increase, and fit a manifold to this data. 

2. Approximation of metric spaces 

In this section we collect preliminaries about GH and quasi-isometric approxi¬ 
mation of metric spaces. In subsections 12.31 and 12.41 we present algorithms that can 
be used to verify the assumptions of Theorems Q] and [2] 

2.1. Gromov-Hausdorff approximations. Let A be a metric space. Recall that 
the Hausdorff distance between sets A, B C A is defined by 

dn{A, B) = inf{r > 0 : A C U r (B) and B C U r (A)} 

where U r denotes the r-neighbor hood of a set. 

The Gromov-Hausdorff (GH) distance cIgh{X : Y) between metric spaces X and 

Y is defined as follows: for every e > 0, one has cIgh{X, Y) < e if and only if there 
exists a metric space Z and subsets X',Y' C Z isometric to X and Y, resp., such 
that d# (A', Y') < e. One can always assume that Z is the disjoint union of X and 

Y with a metric extending those of X and Y. 

The pointed GH distance between pointed metric spaces (X,xo) and ( Y,yo ) is 
defined in the same way with an additional requirement that dz{xo,yo) < s. See 
e.g. [SS §1.2 in Ch. 10] for details. 

Example 2.1 (Distorted net). Recall that a subset S' of a metric space X is called 
an e-net if U e (S) = X. Let S be an e-net in X and imagine that we have measured 
the distances between points of S with an absolute error e. That is, we have a 
distance function d' on S x S such that \d'(x, y) — d(x, y)\ < e for all x,y £ S. Then 
the GH distance between X and (S, d’) is bounded by 2e. This follows from the 
fact that the inclusion S X is an e-isometry from ( S,d') to (X,d), see below. 

Strictly speaking, the ‘measurement errors’ in this example may break the tri¬ 
angle inequality so that (S, d') is no longer a metric space. This can be fixed by 
adding 3e to all d'-distances. 

Let X,Y be metric spaces and e > 0. A (not necessarily continuous) map 
/: X —> Y is called an e-isometry if /(A) is an e-net in Y and 

\dv{f{x),f{y)) - d x (x, y)\ < e 

for all x,y £ X. Equivalently, an e-isometry is a (1, e)-quasi-isometry (cf. Definition 
CE3D- If dcH{X : Y) < e then there exists a 2e-isometry from X to Y, and conversely, 
if there is an e-isometry from X to Y then dGH(X,Y) < 2e, see e.g. [201 Corollary 
7.3.28]. Throughout the paper we use these facts without explicit reference. 

Clearly a (A, e)-quasi-isometry from a bounded space A is a ((A — 1 )D + e)- 
isometry where D = diam(A). This implies (11.611 and (11.71) . 

2.2. Almost intrinsic metrics. Here we discuss properties of d-intrinsic metrics 
and related notions from Definition 11.21 First observe that, if aq, X 2 , ■ ■ ■, x x is a 
d-straight sequence, then its ‘length’ satisfies 

JV-l 

F, d(xi,x i+ i) < d(xi,x N ) + (N - 2)6. 

2=1 


( 2 . 1 ) 


GEOMETRIC WHITNEY PROBLEM 


15 


This follows by induction from Q and the triangle inequality. 

The next lemma characterizes almost intrinsic metrics as those that are GH close 
to Riemannian manifolds. However manifolds provided by this lemma may have 
extremely large curvatures and tiny injectivity radii. 

Lemma 2.2. Let X be a metric space and S > 0. 

1. If there exists a length space Y such that dcH{X,Y) < 5, then X is 65- 
intrinsic. 

2. Conversely, if X is 5-intrinsic, then there exists a two-dimensional Riemann¬ 
ian manifold M such that dcH^X, M) < C6, where C is the universal constant. 

Proof. 1. By the definition of the GH distance, there exists a metric d on the 
disjoint union Z := XU Y such that d extends dx and dy and dn(X,Y) < S 
in ( Z,d ). Let x,x' £ X. Since dn(X,Y) < 6, there exist y,y' £ Y such that 
d(x,y) < 6 and d(x',y') < 8. Connect y to y' by a minimizing geodesic and 
let y = x/i, 1 / 2 , ■ • ■, vn = y' be a sequence of points along this geodesic such that 
d(yi,yi+i) < S for all i. For each i = 2, ...,N — 1, choose Xi £ X such that 
d(xi,yi) < 6. Then x,X 2 , ■ ■ ■ ,Xn- i,x’ is a 65-straight 35-chain connecting x and 
x'. Since x and x' are arbitrary points of X , the claim follows. 

2. Since we do not use this claim, we do not give a detailed proof of it. Here is 
a sketch of the construction. First, arguing as in EH Proposition 7.7.5], one can 
approximate X by a metric graph. If X is 5-intrinsic, the graph can be made GH 
(75-close to X. Consider a piecewise-smooth arcwise isometric embedding of the 
graph into R 3 and let M be a smoothed boundary of a small neighborhood of the 
image. Then M is a two-dimensional Riemannian manifold which can be made 
arbitrarily close to the graph and hence C5-close to X. □ 

Now we describe a construction that makes a (75-intrinsic metric out of a metric 
which is 5-close to R" at scale r (see Definition ll.il) . More generally, let X = ( X , d) 
be a metric space in which every ball of radius r is 5-intrinsic, where r > 5 > 0. 
For x, y £ X , define the new distance d! {x, y) by 



( 2 . 2 ) 


where the infimum is taken over all finite sequences X\,..., Xx connecting x to y 
and such that every pair of subsequent points Xi,Xi+± is contained in a ball of 
radius r in ( X , d). 

In order to avoid infinite cf-distances, we need to assume that any two points can 
be connected by such a sequence. If this is not the case, X divides into components 
separated from one another by distance at least r. For our purposes, such compo¬ 
nents are unrelated to one another just like connected components of a manifold. 

Lemma 2.3. Under the above assumptions, the function d' given by m is a 
CS-intrinsic metric on X, where C is a universal constant. Furthermore, d and d' 
coincide within any ball of radius r. 

Proof. The triangle inequality for d implies that d' is a metric, d! > d, and d!(x, y) = 
d{x,y) if x and y belong to an r-ball in (X,d). It remains to verify that (X,d r ) 
is (75-intrinsic. Let x,y £ X and let x = x\,...,xx = y be a sequence almost 
realizing the infimum in (12.21) . Every pair Xi,Xi+i belongs to an r-ball in (X,d). 
Since this ball is 5-intrinsic, there exists a 5-straight 5-chain connecting Xi to Xi+i. 
Joining such chains together yields a 5-chain connecting x to y. Using the triangle 
inequality, one can easily verify that this chain is 105-straight. □ 

The next lemma shows that if a map is almost isometric at small scale then it is 
a quasi-isometry with small constants. It is used in the proof of Theorem [T] 
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Lemma 2.4. Let r > 55 > 0. Let X and Y be S-intrinsic metric spaces and 
f:X^Y a map such that f(X) is a 5-net in Y and 

(2-3) \dY{f(x),f(y))-d x (x,y)\ <5 

for all x, y £ X such that 

min {d x {x,y),d Y (f(x), f(y))} < r. 

Then f is a (1 + Cr~ * l 5, C5)-quasi-isometry, where C is a universal constant. 
Proof. Let x,x' £ X and D = d.\(x,x'). We have to verify that 

(2.4) (1 + CY-M)- 1 ^ - C5 < d Y {f(x), f{x')) < (1 + Cr~ 1 5)D + C6 

for a suitable C. Fix x, x' £ X and connect them by a (5-straight 5-chain (see 
Definition Oil . This chain contains a subsequence x = Xi,X 2 , ■ ■ ■ ,Xn = x' such 
that i — 5 < dx{xi, Xi+i) < r for all i = 1,..., N — 2 and dx{xx- i, xn) < r. Since 
the subsequence is also 5-straight, by EH) we have 

(2.5) 53dx(*i,Xj+i)<I> + (JV-2)5. 

Since dx{xt, ay+i) > r — 5 for each i < N — 2, the left-hand side is bounded below 
by (N — 2)(r — 5). Hence 

(2.6) N < (r — 25)~ 1 D + 2 < 2r~ x D + 2. 

By (12.31) we have dy(f(xi), /(x*+i)) < dx{xi, Xi+i) + 5 for all i. Therefore 
^dy(/(xj),/(xi+1)) < '^2d x {x i ,x i+ 1 ) + (N - 1)5 < D + (2N - 3)5. 
by (12.51) . Thus 

d Y (f(x), /(a/)) < D + (2N - 3)5 < D + (Ar^D + 1)5 = (1 + 4r~ 1 5)D + 5 

where the second inequality follows from (12.61) . This proves the second inequality 
in (15~fl . To prove that first one, apply the same argument to an ‘almost inverse’ 
map g:Y X constructed as follows: for each y £ Y, let g{y) be an arbitrary 
point from the set f~ 1 (Bs(y)). □ 

2.3. Verifying GH closeness to the disc. Here we present an algorithm that 
can be used to verify the main assumption of Theorem [T] Namely, given a discrete 
metric space X, n € N and r > 0, one can approximately (i.e., up to a factor 
C = C(n )) find the smallest 5 such that X is 5-close to R n at scale r (see Definition 
EH- Due to rescaling it suffices to handle the case r = 1. 

Thus the problem boils down to the following: given a point Xq € X , find 
approximately the (pointed) GH distance between the metric ball B*{x o) C X 
of radius 1 centered at Xq and the Euclidean unit ball C K”. The following 
algorithm solves this problem. 

Algorithm GHDist: Assume that we are given n, the point Xo £ X and the ball 

X 0 = Bi(x o) C X. We regard A 0 as a metric space with metric d = dx \ x 0 x x 0 ■ 
We implement the following steps: 

(1) Let x\ £ Xq be a point that minimizes |1 — d(x o,x)| over all x £ Xq. 

(2) Given x±,X 2 , ■ ■ .x m for to < n, we define the coordinate function 

(2.7) fm{x) = \ (d{x, x 0 ) 2 3 - d(x, x m ) 2 + d(x 0 , x m ) 2 ) 

(3) Given X\,X 2 : ■ ■ ■ x m and coordinate functions fi(x), f 2 (x), ..., f m (x) for 
to < n — 1, choose x m +i that is the solution of the minimization prob¬ 
lem 

min K m (x), K m {x) = max(|l - d(x 0 ,x) |, |/i(x)|,..., |/ m (x)|). 
x&X 0 
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(4) When xi,x%,.. .x n and coordinate functions fi(x), /2 (re),..., f n {x) are de¬ 
termined, we compute for /( x) = (fm(x))m=i 


5i = sup d(x',x)-\f(x')-f(x)\ , S 2 = sup inf \f(x) — y\, 

x,x’eX 0 X £ x 0 

5 a = max(<5i, J 2 )- 

Then the algorithms outputs the value of 5 a and the map /. 


Lemma 2.5. Suppose there exists a 5-isometry h: Xq — > B " C R” satisfying 
h{x 0 ) = 0. Then 

(1) The output value 5 a of the above algorithm satisfies 5 a < C5. 

(2) The map f: Xq —> R™ obtained in the algorithm is a C5-isometry from Xq 
to the Euclidean ball B™, s . 

(3) Moreover there exists an orthogonal map U : R” —> R™ such that 

\f(x) - U(h(x ))| < C5 

for all x £ Xq. 

Here C is a constant depending on n. 


Proof. It follows from the definition of 5 a that / : Xg —>• R n is a (5 a -isometry from 
Xq to B™ +s . Thus, the second claim follows from the first and third claims. Let us 
proceed with their proofs. 

As h is a (5-isometry, for any y\ £ dB ” there is x\ £ X 0 such that \h{x'f) — y\ \ < 5 
and hence |/i(a; , 1 )| > 1 — 5. Using again the fact that h is a (5-isometry, we see 
that d{x' 1 ,x 0 ) > 1 — 2 5. Hence, the point x\ chosen in the algorithm satisfies 
d(xi,Xo) >1 — 2(5 and |/i(a;i)| > 1 — 3J. 

Assume now that we have constructed, using the algorithm, the points Xi,, x m , 
m < n, the corresponding fi(x), i = 1,..., m, see m , and 

hi(x) := (h(x), h{xi)) = \{\h(x)\ 2 - \h(x) - h(xi)\ 2 + \h(xi)\ 2 ), i = l,...,m, 

where (•,•) is the inner product of R n . As h is a (5-isometry, we have, for some 
C = C(n), 

(2.8) \h m (x) - fm(x)\ < CS, X £ Xq. 

Moreover, assume next that for i,k £ {1, 2,..., m}, i fc, we have 

(2.9) \h( Xi )\ > 1 - C6, \hi(x k )\ = \(h(xi),h{x k ))\ < C5. 

Then, let y m +i £ dB " D {h(x 1 ),..., h(x m )}^. Then there is x' m+1 £ X 0 such that 
\H x 'rn+i) - Vm+ 1 | < 5 . This yields that 

(2.10) \Kx ' m+1 )| > 1 - <5. 

Moreover, 

\hi{x' m+ i)\ = \{h(xi), h(x' m+1 ))\ <C5, i = l,...,m. 

Due to (12.81) . (12.101) . the above inequality implies that K m+ 1 «, +1 ) < C5. This 
implies that the minimizer x m +i of K m+ 1 also satisfies K m+ i(x m +i) < C5. As h 
is a (5-isometry, it follows from m that (12.81) remains valid for i = m + 1. In turn, 
these imply that (12.91) is valid also for i, k £ {1,2,..., m + 1}, i ^ k. By induction, 
(EH) is valid for all i, k £ { 1 , 2 ,..., n}, i 7 ^ k. 

Applying Gram-Schmidt algorithm to vectors h(xj) and formula (12.91) . we see 
there is an orthonormal basis (wj)f =1 of R n such that \wj — h(xj)\ < C5. Hence, 
using formula (12.81) . we see that Ay = a li near operator 
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A : R" —> R” that satisfies ||A — J|| < CS. Let U: R” —> R" be the orthogonal 
linear map which maps (u> m )^ =1 to the standard basis (e m )^ =1 of R". By <12^1) . 

n 

\UA(h(x)) - f(x )I = | ^2 ( h m{x ) - fm{x))e m | < CS 

m= 1 

for all x £ Xq. The 3rd claim of the lemma follows. 

As h is a (5-isometry, this also proves that / is a C(5-isometry from Xq to a ball 
Bp where p = 1 + CS. The 1st claim of the lemma follows. □ 

The above lemma implies that the (pointed) Gromov-Hausdorff distance between 
Xq and B " satisfies 

C- 1 2 * * * * 5 a <d GH (X 0 ,B?)<26 a . 

Thus the algorithm GHDist gives the Gromov-Hausdorff distance of Xo and R™ up 
to a constant factor C depending only on dimension n. 

Remark 2.6. As a by-product of Lemma l2.5l we get the following fact: if hi, /12 ■ Xq —> 
B " are two 5-isometries, then there exists an orthogonal map U: R" R” such 
that \hi(x) — U(h 2 (x))\ < CS for all x £ X 0 . The map U can be constructed by a 
modification of the algorithm GHDist , see the proof of Lemma 12.51 

2.4. Learning the subspaces that approximate the data locally. Let X be 

a finite set of points in E = R w and X GB\(x) := {x,x \,..., x s } be a set of points 
within a Hausdorff distance S of some (unknown) unit n-dimensional disc D\(x) 
centered at x. Here B\{x) is the set of points in R w whose distance from x is less 
or equal to 1. We give below a simple algorithm that finds a unit n-disc centered 
at x within a Hausdorff distance CnS of Xq := X fl B\{x), where C is an absolute 
constant. 

The basic idea is to choose a near orthonormal basis from Xq where x is taken 
to be the origin and let the span of this basis intersected with B\ (x) be the desired 
disc. 

Algorithm FindDisc: 

(1) Let X\ be a point that minimizes |1 — \x — cc 7 || over all x' £ Xo. 

(2) Given x ±,... x m for m < n — 1, choose x m +i such that 

max(|l -\x- x'\\, |(xi/|a;i|,a; , )|,..., \{x m /\x m \, x')\) 
is minimized among all x' £ Xo for x' = 

Let A x be the affine n-dimensional subspace containing x,x±,... ,x n , and the unit 
n-disc Di(x) be A x fl B\{x). Recall that for two subsets A, B of R^, du{A,B) 
represents the Hausdorff distance between the sets. The same letter C can be used 
to denote different constants, even within one formula. 


Lemma 2.7. Suppose there exists an n-dimensional affine subspace A x containing 
x such that D\{x) = A x HBi(x) satisfies dn(Xo, Di{x)) < S. Suppose 0 < S < 

Then dn{Xo, D\{x)) < CnS , where C is an absolute constant. 


Proof. Without loss of generality, let x be the origin. Let d(x, y) be used to denote 
\x — y\. We will first show that for all m < n — 1, 


max ( |1 - d(x,x m+ i) 


Xl 

kil 


j" 5 ^m+1 


j" i 1 


< 5. 


To this end, we observe that the minimum over D\(x) of 


( 2 . 11 ) 


max ( |1 - d(x, y)\, 


/(xi) 


Fl 


( x m ) 
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is 0, because the dimension of D\(x ) is n and there are only m < n — 1 linear 
equality constraints. Also, the radius of D\(x) is 1, so 11 — d(x, z m+ \)\ has a value 
of 0 where a minimum of (12.1111 occurs at y = z m +Since the Hausdorff distance 
between D\ ( x ) and Xq is less than S there exists a point y m + 1 £ Xq whose distance 
from z m +i is less than S. For this point y m +i, we have <5 greater than 


( 2 . 12 ) 


max 


^|1 - d(x,y m+1 )\, 



2/ra+l J 



Since 


max 


^|1 d^XjXfji- i_i)| ? 



^m+l 



^m+1 J 


is no more than the corresponding quantity in (|2.12[) , we see that for each ra+1 < n, 


max 


^|1 - d(x,x m+1 )\, 



^m+1 



< <5. 


Let V be an N X n matrix whose i th column is the column Xi. Let the operator 
2-norm of a matrix Z be denoted \\Z\\. For any distinct i, j we have \{xi,Xj)\ < <5, 
and for any i, \(xi,Xi) — 1| < 26, because 0 < 1 — S < \xi\ < 1. Therefore, 


\\V t V — /|| < CmS- 


Therefore, the singular values of V lie in the interval 


Ic = (exp(— CnS), exp(CVn5)) D (1 — C\nS, 1 + C\n5). 

For each i < n, let x\ be the nearest point on D\{x) to the point x^. Since the 
Hausdorff distance of Xq to D\(x) is less than 8, this implies that \x\ — Xi\ <6 for 
all i < n. Let V be an N xn matrix whose i th column is x[. Since for any distinct 
i,j \{x[, x'j) | < 3<5 + 6 2 , and for any i, \{x[, x[) — 1| < 4(5, 

HV^V" — i'll < CnS. 

This means that the singular values of V lie in the interval Ic- 

We shall now proceed to obtain an upper bound of CnS on the Hausdorff distance 
between Xq and D\(x). Recall that the unit n-disc D\(x) is A x D Bi(x). By the 
triangle inequality, since the Hausdorff distance of Xq to D\(x) is less than 5 , it 
suffices to show that the Hausdorff distance between Di (x) and D\(x) is less than 
CnS. 

Let x' denote a point on D\ (x). We will show that there exists a point z' G D\ (x) 
such that \x' — z'\ < CnS. 

Let Va = x'. By the bound on the singular values of V, we have |a| < exp(Cn<5). 
Let y' = Va. Then, by the bound on the singular values of V, we have \y'\ < 
exp(Cn(5). Let z' = y'/\y'\. By the preceding two lines, z' belongs to D\(x). We 
next obtain an upper bound on \x' — z'\ 

(2.13) \x'-z' | < \x'-y'\ 

(2.14) +\y' - z'\. 

We examine the term in (12.131) 

\x' — y'\ = \Va — Va\ < sup |x* — — SnexTp(CnS). 

i 

i 

We next bound the term in (12.141) . 


| y' — z'\ < \y'\(l — exp(— CnS)) < CnS. 
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Together, these calculations show that 

\x' — z'\ < CnS. 

A similar argument shows that if z" belongs to D\(x) then there is a point p' € 
D\(x) such that | p' — z"\ < CnS ; the details follow. Let V(3 = z". From the bound 
on the singular values of V, |/3| < exp [CnS). Let q' := Vf3. Let p' := 


< 

w - 

z"\ + \ P '-q'\ 

< 

\vp 

-Vf}\ + \1 — V/3\ 

< 

sup| 

i 


< 

5nexp(Cn5) + C6n 

< 

CSn. 



This proves that the Hausdorff distance between Xq and Di[x) is bounded above 
by CnS where C is a universal constant. 

□ 

3. Proof of Theorem [2] 

The statement of Theorem[2]is scale invariant: it does not change if one multiplies 
r and S by A > 0 and applies a A-homothety to all subsets of E. Hence it suffices 
to prove the theorem only for r = 1. When r = 1, the theorem turns into the 
following proposition (where ao is renamed to <5o) : 

Proposition 3.1. There exists <5o = So(n) > 0 such that the following holds. Let E 
be a separable Hilbert space, X C E and 0 < <5 < <5o ■ Suppose that for every x € X 
there is an n-dimensional affine subspace A x C E through x such that 

(3.1) d H (X n Bx(x),A x n Bi(x)) < 6. 

Then there is a closed n-dimensional smooth submanifold M C E such that 

1. d H (X, M) < 5(5. 

2. The second fundamental form of M at every point is bounded by CS. 

3. The normal injectivity radius of M is at least 1/3. 

/. The normal projection Pm - —> M satisfies 

(3.2) \Pm(x) - P M (y)\ < C\x- y\ 
and 

(3.3) \\d k x P M \\ < C n , k 5 

for all x, y £ and all k> 2. 

The proof of Proposition ^.ll occupies the rest of this section. Let X and {A x } xe x 
be as in the proposition. Let Pa x : E —> A x be the orthogonal projection to A x . 
By A x we denote the linear subspace parallel to A x . For x € X and p > 0, we 
define B*(x) = X fl B p (x) and D p (x) = A x fl B p (x). In this notation, m takes 
the form 

(3.4) dn{Bf (x), Di{x)) < 6, x e X. 

In the sequel we assume that S is sufficiently small so that the inequalities arising 
throughout the proof are valid. The required bound for S depends only on n. 

Lemma 3.2. Letp,q £ X be such that \p — q\ < 1. Then dist(g,A p ) < S and 
Z(A pi A q ) < 55. 
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Proof. Since q £ Bff (p ), we have 

dist (q,A p ) < dist(g, Di{p)) < d H (B* (p), D 1 (p)) < 5 

by (13.dl) . It remains to prove the second claim of the lemma. 

Let z = Pa p ()■ Then \z — p\ < \ and \z — q\ < \ + 5 by the triangle 
inequality. Define B = A p fl Bi/ 2 _ 2 s(z). We claim that dist(y, A g ) < 25 for every 
y £ B. Indeed, let y £ B. Then \y — q\ < 1 — 5 and \y — p\ < 1 — 25. The latter 
implies that y £ Di(p), hence by (13.dl) there exists x £ X such that \x — y\ < 5. By 
the triangle inequality we have x £ B 4 (q), hence (13.41) implies that dist(x, A q ) < 5. 
Therefore dist(y,A q ) < \y — x\ + dist(x, A g ) < 25 as claimed. 

Define a function h: A p —>■ K+ by h{v) = dist (2 + v,A q ) 2 . As shown above, 
h{v) < 45 2 for all v £ A p such that |u| < | — 25. The function h is polynomial of 
degree 2, i.e., h(v) = Q(v ) + L(v) + ho where Q is a (nonnegative) quadratic form, 
L is a linear function and ho = h{ 0). Furthermore, 

Q(v) = sin 2 Z(v, A q ) ■ \v\ 2 

for all v £ A p . Let a = Z(A p , A q ), and let vq £ A p be such that Z(v 0 , A q ) = a and 
| no | = \ — 25. Then 

QM = hM + h(-VQ) _ m s 4tj2 

since /i(±n 0 ) < 4<5 2 and /i(0) > 0. Thus sin 2 (o;) • |no| 2 < 45 2 , or, equivalently, 
sina < 2(5(| — 25) _1 = 4J(1 — 45) -1 . 

If 5 is sufficiently small, this implies the desired inequality a < 55. □ 

Let Xq be a maximal (with respect to inclusion) yT,-separated subset of X. Note 

that Xq is a y^y-net in X and Xo is at most countable. Let Xq = {qi}[=°i ■ For 
brevity, we introduce notation A, = A qi and Pi = Pa p . ■ 

Throughout the argument below we assume that |Xo| = 00 , i.e. Xq is a countably 
infinite set. In the case when Xo is finite the proof is the same except that ranges 
of some indices should be restricted. 

Assuming that 5 < , there is a number N = N(n) such that every set of the 

form Xo D B\(qi) contains at most N points. This follows from the fact that this 
set is y^Q-separated and contained in the 5-neighborhood of a unit n-dimensional 
ball Di(qi). 

Fix a smooth function /j : M + —> [0,1] such that p{t) = 1 for all t £ [0, y] and 
p(t) = 0 for all t > For each i > 1 define a function y,i: E —> [0,1] by 

Pi{x) = p(\x - qi\). 

Clearly /j, is smooth and ||d^/Zj|| is bounded (by a constant depends only on n 
and k) for every k > 1. Let pn : E —> E be a map given by 

(3.5) Pi(x) = pi{x)Pi{x) + (1 - pi{x))x. 

Now define a map fi: E —> E by 

(3.6) fi = <Pi o Pi -1 0 • ■ • o Pi 
for all * > 1, and let fo = icIe- 

For x £ E and i > 1 we have fi(x) = fi- i(x) if |/,;_i(x) — g,| > \. This follows 
from the relation fi = pi o /j_i and the fact that ifi is the identity outside the ball 
B1/2 

Let U = L r 1 / 4 (X 0 ) C E. We are going to show that for every x £ U the sequence 
{fi(x)} stabilizes and hence a map / = lirn^oo fi is well-defined on U. 

Define B m = B 1 / 4 (q rn ) for m = 1,2,-Note that U = \J m B m . 
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Figure 1. A schematic visualisation of the interpolation algo¬ 
rithm Algorithm ‘Surfacelnterpolation’ based on Theorem see Sec¬ 
tion [5] In the figure on top, the black data points X C E = R m 
have a ^-neighbourhood U = Us{X). The boundary of U is marked 
by blue. In the figures below, we determine, near points Xi € X, 
i = 1, 2, 3 the approximating n-dimensional planes A*, marked by 
red lines. Then we map the set U by applying to it iteratively 
functions ipi : E E, defined in (13.51) . The maps <fii are convex 
combinations of the projector Pa*, onto A*, and the identity map. 
Figures 2,3, and 4 from the top show the sets ipi(U), <P 2 (<Pi(U)) 
and <P 3 (‘fi 2 ( l Pi(U ))), respectively. The limit of these sets converge 
to the n-dimensional surface M C E. 


Lemma 3.3. If x £ B m then \fi(x) — q m \ < ^ for all i. 

Proof. Suppose the contrary and let 

i 0 = min{* : \fi{x) - q m \ > 5 }. 

Let i < io be such that \qi — q m \ < 1 and let 2 = /*_i(x). Since i — 1 < zo, 
we have \z ^ q m \ < Lemma 13.21 applied to p = qi and q = q m implies that 
\Pi{z) — P m {z)\ < 6(5. Since P m is the orthogonal projection to a subspace containing 
q mi we have \P m (z) - q m \ < \z - q m |, therefore 

I Pi(z) - q m | < | Pm(z) - q m I + I Pi{z) - Pm(z )I <\z- q m \ + 6 S 

and hence the point 

fi{x) = ipi(z) = Hi{z)Pi{z) + (1 - Hi(z))z 

satisfies 

| fi{x) - q m | < Hi(z)\Pi(z) - q m | + (1 - Hi(z))\z - q m \ <\z- q m \ + 6(5. 

Thus 

(3.7) \fi( x ) — Qm\ < \fi-l(x) — Qm\ + 6(5 

for all i < io such that \qi — q m \ < 1. For indices i < io such that \qi — q m \ > 1, we 
have 


\fi~i(x) -qi\>l- \ fi-i{x) -q m I > 1 - | > 5 
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and hence fi{x) = fi- i(x). Since there are at most N = N{n ) indices i < i o such 
that \qi — q m \ < 1, by (Id.711 it follows that 

\fi 0 (x) - q m | < \x- q m | +6A^5 < |x- q m \ + ^ < 5 , 
provided that S < 1/1207V. This contradicts the choice of io- □ 


Lemma f3.3l implies that there exists only finitely many indices i such that /* \ 

Indeed, if fi(x ) ^ fi-i(x) for some x £ B m , then | q t - q m \ < 1 because 
\fi-i(x) — q m | < -| by Lemma T3.31 and |/j_i(x) — qi\ < \ (since pi is the identity 
outside Bi/ 2 {qi))- Thus the sequence {fi\ Bm }iZi stabilized and hence the map 

(3.8) /(x) = lim fi(x ) 

i—too 

is well-defined and smooth on B m . Since m is arbitrary, / is well-defined and 
smooth on U = U m B m • 


Lemma 3.4. For every q m £ Xq and every k > 0 we have 

(3.9) ||/i - Pm\\c k (B m ) < C5 for all i > m, 
and therefore 

(3.10) \\f - PmWcHBm) <C6. 

Proof. Let I m = {i : \qi — q m \ < 1} and let ji < ■ ■ ■ < jN m be all elements of I m . 
Recall that N m = \I m \ < N = N(n). As shown above, Lemma 13,31 implies that pi 
is the identity on fi-i(B m ) for i ef I m . Therefore for every i we have 

(3-11) fi\B m = <Pj lw o < o...op h \ Bm 

where l(i ) = maxjfc : jk < *}. 

We compare pi and fi with maps pi and fi defined by 
Pi = Pi{x)P m {x) + (1 - Pi{x))x, 


and 


fi=Pi O Pi-1 o ...o pi. 

By induction one easily sees that 

(3.12) fi(x) = A i{x)P m {x) + (1 - A i(x))x 

for some \{x) £ [0,1], Ai(x) < A 2 (x) <_ Therefore fi(B m ) C B m for all i. 

Similarly to the case of fi this implies that 

(3-13) fi\ Bm = o Pj Ki) _ x o . .. o p h \ Bm 

By Lemma T3. 2 1 for every i £ I m we have 

||P(x) - P m (x)|| < CS, || d x Pi - d x P m || < C6 
for all x £ Bi(q m ) and therefore 

\\Pi — 7 , i|lc fe (Bi(g m )) = II h’i ' (Pm — Pi) \\c k (Bi(q m )) — ^ A 

This estimate, (13.111) . (13.131) and the fact that l{i) < \I m \ < N{n) imply that 


(3-14) Wfi-fA\c*{B m )<C5 

for all i and k > 0. Observe that p m \s m = P m \ Bm since p m = 1 on B m . This fact 
and relations (13.121) imply that fi\ Bm = P m |s m for all i > m. Therefore for i > m 
the estimate (13.111) turns into (13.91) and the lemma follows. □ 


Lemma 3.5. fm{B 

m ) C D 1/3 (q m) • 

Proof. Let x £ B m and y = f m - i(x), then / m (x) = p m {y)- By Lemma E31 
\y - q m \ < Therefore p m (y) = 1 and hence p m (y) = P m (y)- Thus f m (x) = 
Pm(y) £ Pl/3 (dm)- f-1 
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By definition, / = g o f m for some smooth map g: E -A E. Therefore f(B m ) is 
contained in an image of an n-dimensional disc Di/ 3 {q m ) under a smooth map g. 

Lemma 3.6. f(B m ) C Uis(D 1 / 3 (q m )) for every m, and f(U) C U 5S (X). 

Proof. Let x £ B m . By Lemma [3.31 we have fi(x) £ B 1 / 3 (q m ') for all i. Let us 
show that fi(x) £ U 4 s(A m ) for all i > m. This is true for i = m since f m (x) £ 
B>i/ 3 {qm) C A m by Lemma T3.51 Arguing by induction, let * > m and assume that 
y = fi-i{x) £ U 4 s(A m ). If | y - qi\ > then f m (x) = y £ U 4 s{A m ), so we assume 
that \y — qi\ < \- Note that 

I <n - q m \ < \q m - y\ + \y - *1 < § + \ < l- 
By definition, the point f%[x) = tfii(y) belongs to the line segment [ yz\ where z = 
Pi(y). Since z £ Ai and \qi — z | < \qi — y\ < we have 

dist(z, A m ) < distfe, A m ) + \ sin Z(A il A m ) < S + f<5 < AS 

where the second inequality follows from Lemma 13.21 Thus z £ U 4 s(A m ). Since 
fi{x) £ [yz\, both y and z belong to U 4 s(A m ) and U 4 s(A m ) is a convex set, fi(x) £ 
U 4 s(A m ) as claimed. 

Thus fi(x) £ U 4 s(A m )r\Bi/ 3 (q m ) for all x £ B m and all i > m. This implies the 
first claim of the lemma. To prove the second one, recall that Di(q m ) C Us(X) by 
m- Hence f(B m ) C U 4 s(D 1 / 3 (q m )) C U$s{X). Since m is arbitrary, the second 
assertion of the lemma follows. □ 

Now define 

(3.15) M = f(U 1/5 (X 0 )). 

We are going to show that M is a desired submanifold. 

Lemma 3.7. For every y £ M there exists q m £ X 3 such that \y — q m \ < jA. _|_ 5 ^ 
and 

M fl B 1 / W0 (y) C f{D 1/w {q m )). 

In particular, M = (J m f{D 1 / w {q m )). 

Proof. By Lemma [3.61 y £ U 3 s{X). Since Xq is a p^-net in X , there is point 
q m £ Xq such that |y — q m \ < jgp + 5<5. Let us show that this point satisfies the 
requirements of the lemma. Let W = M fl Bi/ W0 (y) and D = D 1 / W (q m ). We are 
to show that W C f{D). Fix a point z £ W. Observe that 

\z - Qm\ < \z - y\ + \y - q m \ < 1^0 + 1M + C5 =h + CS - 
Since z £ M, we have z = f(x) for some x £ Ui/ 5 (X 0 ). Let p £ X 0 be such that 
\x — p\ < |. Then |z — Pa p {x)\ < CS by Lemma [3~T1 On the other hand, 

\x-P Ap {x)\ <\x-p\<\. 

Therefore 

\x - q m | <\x- Pa p (x)\ + I z- Pa p (x) \ + \z - q m \ < \ + CS + ^ + CS < 
thus x £ B m . 

By Lemma [3~fl it follows that | z — P m (x)\ = \f(x) — P m {x)\ < CS and \f m (x) — 
P m (x)\ < CS. Therefore | f m (x) — z\ < CS and hence 

| fm(x) ~ qm I < I fm(x) - z\ + \z - q m \ < + CS. 

By Lemma ECU we have f m ( x) £ A m , hence f m ( x) £ D 1 / 50+cs (q m ). 

Now consider the map f m \D- By Lemma 13.51 its image f m (D) is contained 
in A m . By Lemma 13.41 f m \D is C^-close to the projection P m |u, which equals 
idjo since D C A m . Thus f m \D is C(5-close to the identity and maps D to a 
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subset of the n-dimensional subspace A m . By topological reasons, see [B2], Thm. 
1.2.6], this implies that / m (D) contains an n-ball -Di/io-cat^m)- Since f m (x) £ 
i?i /50 -j-csiqm) C D 1 / io_ cs(Qm), it follows that there exists a point x' £ D such that 
fm{x') = f m {x). Since / factors through / m , this implies that f(x') = f(x) = z. 
Thus z £ f(D). Since z is an arbitrary point of W, the lemma follows. □ 

The next lemma shows that M is a submanifold and provides bounds for deriva¬ 
tives of a parametrization of M. 

Lemma 3.8. M is a closed n-dimensional smooth submanifold of E. Moreover for 
every y £ M there exists a smooth map p: V —> E, where V = B^ 1Q is the ball of 
radius yy in R”, such that y £ p(V) C M and p is CS-close to an affine isometric 
embedding in the C k -topology for any k > 0. 

Proof. Pick y £ M and let q m £ X 0 be as in Lemma HTTl Let D = Di/ 10 (q m ). By 
Lemma EP1 / \d is Cd-close to the identity in the C fc -topology. In particular, f(D) 
is an embedded smooth n-dimensional submanifold. By Lemma 13.71 

f(D) n B 1/100 (y) = In Bi/ W0 (y). 

Hence M D B 1 / W0 (y) is a submanifold for every y £ M and therefore M is a 
submanifold. 

To see that M is closed, recall that | y — q m \ < yyy + 5<5. Since /\d is C5-close to 
identity, this implies that the /-image of the boundary of D is separated away from 
y by distance at least yb — y^ — CS > y^. Therefore M n -Bi/ioo(y) is contained 
in a compact subset of the submanifold f(D). Since this holds within a uniform 
radius yyjg from any y £ M, it follows that M is a closed set in E. 

To construct the desired local parametrization ip, just compose /|d with an affine 
isometry between D and an appropriate ball in R". □ 

Note that the existence of local parametrizations that are C^-close to affine 
isometries (in the C 2 -topology) implies that the second fundamental form of M is 
bounded by CS. Let us verify the remaining assertions of Pronosition l3.il The first 
one is the following lemma. 

Lemma 3.9. dn(M,X) < 56. 

Proof. By Lemma [3.61 we have M C U^s{X). It remains to prove the inclusion 
X C U 5 s(M). Fix x € X and let q m £ X 0 be such that | q m — x\ < y^p. Consider 
the map P m o f\D 1/5 (q m ) from Di/ 5 (q m ) C A m to A m . By Lemma [3.41 this map 
is C5-close to the identity. Therefore its image contains the n-disc D 1 / b _ C5 (q m ). 
This disc contains the point P m {x) because 

| Bmixf) qm\ C \x qm\ f Jqq ^ 5 CS. 

Hence P m ( x) £ Pm(f(D 1 / 5 (q m ))). This means that there exists y £ Di/ 5 (q m ) such 
that P m (f(y)) = Pm(x). By Lemnm ITTTTT1 we have dist(/(y), A m ) < 45 and therefore 

I f(y) - P m (x )| = | f(y) - P m (f(y))\ < 4 5. 

By (13.41) we have dist(a;, A m ) < S and therefore \x — P m {x)\ < S. Hence 

I f(y) ~x\< | f(y) - P m (x) | + \x- P m {x )| < 45 + S = 55. 

Observe that f{y) £ M since y £ Di/ 5 (q m ) C Ui/ 5 (X 0 ). This and the above 
inequality imply that x £ U^{M). Since x is an arbitrary point of X , we have 
shown that X C U$s(M). The lemma follows. □ 
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Remark 3.10. We observe that 

(3.16) M = f{U$(X)) 

(compare with (13.15l) i. Indeed, we have M C (J m f{D 1 / 10 (q m )) by Lemma 1X71 and 
Di/ W (q m ) C Ug(X) b y JO ). 

One can think of (13.161) , (13.151) and the last claim of Lemma 13.71 as various 
reconstruction procedures for M. 

Lemma 3.11. \f(y) — y\ < CS for every y £ U$(X). 

Proof. Since y £ Ug(X), there isiel such that \x — y\ < 5. Pick q m £ X 0 such 
that \x — q m \ < Then y G B m and hence | f(y) — P m {y)\ < CS by Lemma [3~T1 
By dSHJ we have dist(x, A m ) < 5 and hence 

I y - P m (y) \ = dist(y, A m ) < 25. 

Therefore | f{y) - y\ < \f{y) - P m {y)\ + \y~ P m {y)\ <C5 + 25. □ 

Now we are in position to prove the third assertion of Proposition 13.II We are 
going to show that the normal injectivity radius of M is no less than | > | (in 
fact, any explicit constant smaller than ^ would do). Suppose the contrary, i.e., 
that the normal injectivity radius of M is less than |. 

First we observe that any relatively small part of M has large normal injectivity 
radius. More precisely, let k be an upper bound for the principal curvatures of M 
and let E be a ball of radius \k~ 1 centered at x £ M with respect to the intrinsic 
metric of M. Then the normal injectivity radius of E is greater or equal to (Ck)~ 1 . 
Indeed, for any point y £ E we have Z(T y S,T x S) < 1/2. It follows that E is a 
graph of a smooth function over a region in T x E, the first derivatives of this function 
are bounded by 1, and its second derivatives are bounded by Ck - 1 . One easily sees 
that this implies the lower bound ( Cn ) _1 for the normal injectivity radius of E. 
Taking into account that k < C5 , we conclude the normal injectivity radius of any 
intrinsic ball of radius (Ci5) _1 in M is bounded below by (Ci5) -1 . 

Hence the non-injectivity of the normal exponential map within normal distance 
| can occur only for points x,y £ M separated by intrinsic distance at least (C'<5) -1 . 
Thus there are points x,y £ M such that \x — y\ < i and d,M(x,y) > ( C5 ) -1 > 1 
where d,M is the intrinsic distance in M. We are going to show that these two 
inequalities contradict each other. 

Let x,y £ M be as above. Then by (13.161) there are points x',y' £ Ug{X) 
such that f(x') = x and f(y') = y. By Lemma 13.111 we have \x — x’\ < C5 and 
\y — y'\ < C5, hence \x' — y’\ < i + C5 by the triangle inequality. Let x",y" £ X 
be such that \x' — x"\ <5 and \y' — y"\ <5. Then 

\x" - y "| < \x' - y'\ + 25 < f + C5 < 1. 

Hence y" £ B*{x"). This and (13.41) imply that y" £ Us(Di(x")). Therefore both 
x' and y' and hence the line segment [x',y r ] are contained in the 2i5-neighborhood 
of the affine n-disc D\{x"). By (El, this neighborhood is contained in U^g{X). 
Thus [x',y'] is contained in U^g{X) and hence in the domain of /. Consider the 
/-image of the line segment [x , ,y / ]. It is a smooth path in M connecting x and 
y. Lemma I3HJ for k = 1 implies that f is locally Lipscliitz with Lipschitz constant 
1 + CS. Therefore 

length (/([a/, */])) < (1 + C5)\x' - y'\ < (1 + C<S)( | + CS) < 1. 

Hence c?m(*, 2/) < 1, a contradiction. This contradiction proves the third claim of 
Proposition 13.11 
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It remains to prove the fourth assertion of Proposition 13.11 By Lemma 13.81 M 
admits local parametrizations that are (75-close (in any (7 fe -topology) to affine iso¬ 
metric embeddings. This and the fact that the normal injectivity radius is bounded 
below by | imply that at every point x £ U 2 / 3 (M) the normal projection Pm is 
well-defined and its derivatives of any order are (75-close to those of the orthogonal 
projection to an affine subspace. This implies (13.31) . 

In order to prove m, consider the first derivative <I x Pm where x £ U 2 / 5 M. As 
shown above, it is (75-close to an orthogonal projection and hence is Lipschitz with 
Lipschitz constant close to 1. It follows that (13.21) holds (with C ~ 1) whenever 
the line segment [x,i/] is contained in t/ 2 / 5 (Af). This argument handles all pairs 
x,y £ U 1 / 3 (M) with \x-y\ < §-5 = yg. For x, y £ U 1 / 3 (M) such that \x-y \ > 
m follows from the fact that \x — Pm(x)| and | y — Pm(i/)| are bounded by | and 
therefore 

I Pm{x) - Pm{v )| < \x ~ y\ + | < H|x - y\. 

This finishes the proof of Proposition 13.II As explained in the beginning of this 
section, of Theorem [2] follows via a rescaling argument. 

Remark 3.12. The subspaces A x from Definition !!.Dl anproximate the tangent spaces 
of the submanifold M constructed in Theorem [2] More precisely, if x £ X and 
y = Pm(x), then the angle of A x and T y M satisfies 

(3.17) A(A x ,T y M)<CSr~ 1 

where T y M is the tangent space to M at y. 

To prove (13.171) . let A = T y M and consider the intrinsic ball D = B^j 2 (y). Due 
to the bound C5r ~ 2 on the second fundamental form, D is (75-close to the ball 
B^/ 2 (y) in the tangent space. On the other hand, since M is 55-close to X and 
X fl B r (X) is 5-close to A x , D is 65-close to A x . This implies that Pjy 2 (y) is 
(75-close to A x and (13.171) follows. 

Remark 3.13. Lemma l3.4l and the above arguments about Pm imply that 

11 / — PM\\c k (u 1/5 (x)) < CS 

for all k. Thus, for computation purposes, the explicitly constructed map / is as 
good as the normal projection Pm- 

Remark 3.14. Let us show that the constants in Theorem [2] are optimal, up to 
constant factors. Let M C E be a closed n-dimensional submanifold whose second 
fundamental form is bounded by K$ tr = ^5r~ 2 , with 0 < 5 < r < 1, and normal 
injectivity radius is bounded from below by 2r. Let x £ M. Using formula (O 
we see that 

(3.18) d H (BM(x),B% M ( x))<6. 

Here P 2 r ( x ) i s the intrinsic ball in M of radius 2r centered at x. 

Our assumptions on M imply that the normal projection Pm is well-defined 
and 2-Lipschitz in the ball B^(x). Hence for any z £ M D B^(x) the projection 
Pm([x, z ]) of the line segment [x, z] is a curve of length at most 2r. Therefore z = 
Pm(z) £ B^.(x). ThusMflP/'(x) C B^(x). Also note that B^ (x) C MflP/'(x). 
These relations, (13.181) and (11.21) imply that dn(M 0 P,?(x), Bj xM (x)) <5. As x 
above is an arbitrary point of M, we have that M is 5-close to n-flats at scale 
r. This shows that in Theorem [2] the bounds in claims (2) and (3) on the second 
fundamental form and the normal injectivity radius are optimal, up to multiplying 
these bounds by constant factors depending on n. 
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4. Proof of Theorem [T] 

Similarly to the proof of Theorem [21 we first observe that the the statement of 
Theorem [T| is scale invariant and it suffices to prove it for r = 1. When r = 1, 
Theorem [T] is equivalent to the following proposition with So = cro > 0. 

Proposition 4.1. For every positive integer n there exists So = So(n) > 0 such 
that the following holds. Let 0 < S < So and let X be a metric space which is S- 
intrinsic and S-close to R" at scale 1. Then there exists a complete n-dimensional 
Riemannian manifold M such that 

1. There is a (1 + C5, CS)-quasi-isometry from X to M. 

2. The sectional curvature Sec m of M satisfies | SecM | < CS. 

3. The injectivity radius of M is bounded below by 1/2. 

The proof of Proposition 14.11 occupies the rest of this section, which is split into 
several subsections. We recycle the letter r for use in other notation. We fix n 
and assume that a metric space X satisfies the assumption of the proposition for a 
sufficiently small S > 0. 

Fix a maximal -separated set Xq C X. We say that two points x, y £ Wo are 
adjacent if dx(x,y) < 1 and say that they are neighbors if dx(x,y) < y. 

The adjacency relation defines a graph which we refer to as the adjacency graph. 
The set of vertices of this graph is X 0 and the edges are between all pairs of adjacent 
points. We need the following properties of this graph. 

Lemma 4.2. 1. The adjacency graph is connected. 

2. Its vertex degrees are bounded by a constant depending only on n. 

Proof. 1. Let x,y 6 Xq. Since X is ^-intrinsic, there is a <5-chain xi,... ,Xn £ X 
with x\ = x and Xn = y. For each Xi, there is a point x\ £ Wo with dx(xi , x[) < yyy. 
By the triangle inequality, dx^xl , x' i+1 ) < 2<5 + W < 1 for all i, and we may assume 
that x' x = x and x' N = y. Then the sequence x[,..., x' N is a path connecting x to 
y in the adjacency graph. 

2. Let q £ X 0 . Since dff(Bi(q),B") < S, there exists a 2<5-isometry /: Bi(q) —> 
B ”. Let Y = X 0 nBi(q) be the set of points adjacent to q. Since Y is y^-separated, 
its image f(Y) is a (y^ — 2(5)-separated subset of B". We may assume that <5 is 
so small that y^, — 25 > ■ Then the cardinality of Y is no greater than the 

maximum possible number of 2 ^g-separated points in B □ 

Lemma S21 implies that the set Xq is at most countable. In the sequel we assume 
that Wo is countably infinite, Xq = In the case when Wo is finite, the proof 

is the same except that the indices are restricted to a finite set. 

4.1. Approximate charts. Fix a collection of points {pi}'fZ 1 in 1R" such that the 
Euclidean unit balls Di := Bifpf) are disjoint. For r > 0, we denote by the 
Euclidean ball B r (pi) C R". 

Recall that W 0 = i- F° r eac ^ * £ N we have dGH{Bi{qi) 1 D i ) < S since 

Di is isometric to 13". Recall that here we are dealing with pointed GH distance 
between between balls where the centers are distinguished points. Hence there 
exists a 2i5-isometry /,;: Bi(qi) —> Di such that fi(qi) = Pi- 

We fix 2(5-isometries /*: Hi (ft) —> Di , i £ N, for the rest of the proof. The 
balls Di and the maps fi play the role of coordinate charts in W. The next lemma 
provides a kind of transition maps between charts. 

Lemma 4.3. For each pair of adjacent points qi,qj £ Wo, there exists an affine 
isometry Aij : R n R" such that 

(4.1) |A i i(/i(*))-/i(*)|<C'<5 
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for every x £ B 1 (q i ) n B^qf). 

Proof. Let Y = Bi^qf) C\Bi(qj). Since dGH{B\{qi), 6 ") < S and qj £ Bi^qf), there 
exists xq £ Y such that 

max{d x (x 0 ,qi),d x (x 0 ,qj)} < \d x {q%,qj) + 2(5. 

The map fi\y is a C5-approximation from Y to intersection of Euclidean balls 
Z ■= Di nBi +2 {(/.(%)) C R”. By the choice of xo , Z contains the ball of radius | 
centered at fc(x o). Consider the map hi: Y R" defined by h\(x) = fi(x)—fi(x o). 
It is a C(5-isometry from Y to the set Z\ obtained from Z by the parallel translation 
by —fi(xo). Observe that - 61 / 3 ( 0 ) C Z\ C 6 2 (0). Similarly, the map h 2 :Y —> R" 
defined by h 2 (x) = fj{x) — fj(xo) is a C<5-isometry from Y to a set Z 2 C R” with 
similar properties. Note that hi(xo) = h 2 (xq) = 0. 

Arguing as in Lemma [23] (cf. Remark l2.6l) we see that there exists an orthogonal 
map U : R" —> R" such that 

(4.2) \U(h 1 {x))-h 2 (x)\ < CS 
for all a; £ Y. Now define Aij : R n —> R" by 

A ij{y) = U{y- fi(x 0 )) + fj(x 0 ), y £ R". 

This definition and (14.21) implies (14.11) . □ 

We fix maps Ay constructed in Lemma 14.31 for the rest of the proof. We may 
assume that A Jt = A ”- 1 for all i. j and An is the identity map. 

Lemma 4.4. Let qi,qj,qk £ Xo be three pairwise adjacent points. Then 

(4.3) | A jk (Aij(x)) - A ik (x )| < CS 
for all x £ Di. 

Proof. Consider the set Y = Bi(qi) D B\{qj) D Bi(q k ) C X. The map fi\y is a 
C(5-isometry from Y to the intersection of Euclidean balls 

Z := Di D 6i_|_ 2 i(a) D Bi +2 s(b) C R", 

where a = fi{qj) and b = fi(qk). Let x £ Z. Then there exists p £ Y such 
that fi(p) is C(5-close to x. Let y = fj{p) and z = fk(j>)- Then by (14.11) we have 
| Aij(x) — y | < CS, | Ajk{y) — z\ < CS and \A ik (x) — z\ < CS and therefore 

\Ajk(Aij[x)) - A ik (x )| < CS. 

Thus (14.31) holds for every x £ Z. Since Z C Di and Z contains a ball of radius 
it follows that (14.31) holds for all x £ Di. □ 


Lemma 4.5. Let qi, qj, q k £ Xq. Then 

1. If qi and qj are adjacent, then 

\ A ij(Pi)~Pj\ - d x (qi,qj) 

2. If q k is adjacent to both qi and qj, then 


< CS. 


I A ik (pi) - A jk (pj)\ - d x {qi,qj) 


< CS 


Proof. The first assertion follows from the second one by setting k = j (recall that 
Ajj is the identity map). Let us prove the second assertion. 

Since pi = fi(qi), (14.11) implies that A ik {pi) is C<5-close to f k {qi)- Similarly, 
Ajkipj) is C(5-close to f k (qj)- Hence the distance \A ik (pi) — Aj k {pj)\ differs from 
|/fc(<7i) - fk(qj) | by at most CS. In its turn, the distance | fk(qi) ~ fk(qj)\ differs 
from dx(qt,qj) by at most 2 S because fj is a 2S- isometry. Thus \Ai k (pi) — Aj k {pj)\ 
differs from d x {qi,qj) by at most CS and the lemma follows. □ 
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1/3 

Lemma 4.6. For every i £ N and every x £ D j ' there exist j £ N such that qi 
and qj are neighbors and Aij(x ) £ D^ 50 . 

Proof. Since f t is a 25-isometry from Bi(qi) to D i: there exists y £ Bi{q{) C X 
such that | fi(y) — x\ < 25. Since Xq is a jig-net in X , there is a point qj £ Xo 
such that dx(y,qj ) < pjg. For this point qj we have 

\x~ fi(qj )| < \fi{y) - fi(Qj)\ + 25 < d x (y,qj) + 45 < jig +45 

since /j is a 25-isometry. This and the fact that x £ D X J 3 imply that 

\Pi - fi{0j )I < I + IM + 4<5 - 

Since = fi(qi) and is a 25-isometry, it follows that 

dx(qi, qj) < g + iso + 65 < \. 

Thus qi and qj are neighbors, in particular there is a well-defined map Ajj. Since 
Aij is an isometry, we have 

I Aij(x) - Aij(fi(qj))\ = \x- fi(qj )| < ^ + 45. 

By (ThlT) we have \Aij(fi(qj)) - fj(qj)\ < C6, hence 

\Aij{x) - pj\ = \Aij(x) - fj ( qj )| < tm + CS < ^ 

provided that 5 is sufficiently small. Thus A.^ (x) £ O )- / 50 as claimed. □ 


4.2. Approximate Whitney embedding. At this point we essentially forget 
about the original metric space X and use the collection of balls Di C R" and 
maps Ajj from the previous section for the rest of the construction. Let fl = (J Di. 

Let S be the unit sphere in R” +1 centered at e n +i, where ei,...,e„+i is the 
standard basis of R n . Note that S contains the points 0 and 2e„_f_i. For every 
r > 0 we denote by S r the set of points in S lying at distance less than r from the 
‘north pole’ 2e„+i. That is, S r = S fl B r (2e n+ i). 

Fix a smooth map <p: R” — > S with the following properties: 

(1) tp(x) = 0 for all x £ R" \ Bi/ 5 (0). 

(2) <p|.Bi / 5 (o) is a diffeomorphism onto S \ {0}. 

(3) <^|b 1 / 10 (o) is a diffeomorphism onto the spherical cap Si. 

(4) V 5 |s 1 / 50 (o) is a diffeomorphism onto the spherical cap Si/io- 

For each i let pi(x) = g>(x — pf) and define a map Ft: fl —► S C R" +1 as follows. 
If a point x £ fl belongs to a ball Dj, put 


(4.4) 


Fi{x) 


ipi(Aji(x)), 

0 , 


if Dj is adjacent to Di 
otherwise. 


In particular Fj(x) = ipi(x) if x £ Di. 


Lemma 4.7. If Fi(x) ^ 0 for some x £ 


, then qi and qj are neighbors. 


Proof. The assumption Fi(x) ^ 0 implies that qi and qj are adjacent and therefore 
Fi{x) = ipi{Aji{x)). Thus tpi(Aji(: r)) ^ 0 and hence | Aji(x) — Pi\ < g. Since Aji is 
an isometry and | pj — x\ < g, we have 

\Aji(Pj) ~Pi\ < \Pj -x\ + | Aji(x) - pi | < §. 

This and Lemma T4. 51 21 imply that dx(qi,qj) < § + C5 < i, hence qi and qj are 
neighbors. □ 
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Let E be the space of square-summable sequences in R " +1 equipped with 

the norm defined by \u\ 2 = \ u i\ 2 f° r u = ( u i)i*Li- This is a Hilbert space naturally 

isomorphic to £ 2 . Define a map F: fl —> E by 

(4.5) F(x) = (Fiix))^ 

Lemma 14.21 implies that for every x £ U there are only finitely many indices i such 
that Fi(x) ^ 0. Therefore the sequence F(x) £ (R” +1 )°° is finite and hence indeed 
belongs to E. 

Lemma 4.8. 1. F is smooth and moreover 

( 4 - 6 ) \\F\\ c * { n) < C k 

for all k > 0. 

2. For every ieN the restriction F\ d i/ w is uniformly bi-Lipschitz, that is, 

(4.7) C~ x \x - y\ < | F(x) - F(y)\ < C\x - y\ 
for all x, y £ dV 10 . 

Proof. 1. Let x £ Di. By Lemma 14.21 there is at most C indices j such that 
Fj\ n i 0. For every such j we have 11 df r Fj 11 < 1^11 c k (W' 1 ) : therefore |(if 7 1 11 < 
C • ll ( / , |lc fc (K' 1 ) = Cfc. 

2. The second inequality in (14.71) follows from (14.61) . To prove the first one, 
observe that |F(a;) — F(y)\ > \Fi(x) — Fi(y)\ > C~ x \x — y\ since the ith coordinate 
projection from E to R” does not increase distances and Fi\ D i/io = <fi\ D i/io is 
bi-Lipschitz. □ 

Eq. (14.71) implies that the first derivative of F is uniformly bi-Lipschitz, i.e., 

(4.8) C~ x \v | < \d x F(v)\ < C\v\ 
for all x £ D^ 10 and v £ JR". 

Lemma Pi. 8 1 implies that for each i the image Ej := F(D 1 / 10 ) is a smooth sub¬ 
manifold of E. Moreover this submanifold has bounded geometry (e.g., bounded 
curvatures, normal injectivity radius, etc.) We are going to apply Theorem [5] to 
the union E = (J. Ej in E. As the first step, we show that these submanifolds lie 
close to one another. 

Lemma 4.9. Suppose that qi and qj are neighbors and let x £ D\ / 5 . Then Aij(x) £ 
Dj and 

\F(x) — F(Aij(x))\ < CS. 

Moreover, 

(4.9) ||<C(F-FoA ij )|| <C m 6 
for all m > 0. 

Proof. By Lemma T4.51 

| Aij(pi) -pj\ < d x {qi,qj ) + CS < \ + CS. 

Since Aij is an isometry, \Aij(x) — Aij(pf)\ = \x — Pi\ < Therefore 

| Aij(x) -pj\ < | Aij(x) - Aijipi )| + \Aij(pi) -pj\ < | + 4 + CS < 1, 

hence Aij(x) £ Dj. Since x is an arbitrary point of D\/ T> , we have shown that 
•'„.(/>: ■'■) c Dj. 
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Recall that the number of indices k such that F\ does not vanish on Di U Dj 
is bounded by a constant depending only on n. Hence in order to verify (TO! it 
suffices to show that 

(4.10) \\d™{F k ~F k oA l3 )\\<C m 5 

for every fixed k. Consider four cases. 

Case 1: qk is adjacent to both qi and q 3 . In this case 

Fk 1 ^ 1/5 = ipk ° Aik\ d i/5 

and 

Fk ° Aij |^-jl/5 = <fk ° Ajk ° Aij j^l/5 ■ 

Now (14.101) follows follows from the fact that the affine isometries A,k and Ajk o A^ 
are CWclose on Di by Lemma 03) 

Case 2: qk is not adjacent to qi and qj. This case is trivial because Fk\Di and 
Fk o Aij\u i both vanish by definition. 

Case 3: qk is adjacent to qj but not to qi. In this case Fk\D t = 0 by definition. 
Let us show that Fk o Aij\ D i/s also vanishes. Since dx(qk,qi ) > 1, Lemma 03 ] 

implies that \Akj{pk) — Aij (p;)| > 1 — CS. Hence for every y £ Dj ' 5 , 

\A k j(pk) - Aij(y )| > 1 - | - C5 > 

Since Akj = Aj^ and Akj is an isometry, this implies that \pk — Ajk o Aij(y)\ > ^ 
and hence 

F k o Aij(y) = ipk o Aj k o Aij(y) = 0 

for every y £ D X J h . 

Case 4'- qk is adjacent to q-, but not to qj. In this case Fk o A^j^i/5 = 0, so it 
suffices to prove that Fk\ D i/a = 0. Suppose the contrary, then Lemma Pf.71 implies 
that qk and qi are neighbors. Since qi and qj are also neighbors, it follows that qk 
and qj are adjacent. This contradiction proves the claim. □ 

We introduce the following notation for some important subsets of E. For every 
i £ N define 

S i = F{D] /W ) and = F(D\ /5 °). 

Let E = U, £* and E° = Q, E°. 

Recall that E; is a smooth n-dimensional submanifold of E. For a point x £ E^, 
we denote by T x T,i the tangent space of Ej at x realized as an affine subspace of E 
containing x. That is, T x T,i is the n-dimensional affine subspace of E tangent to 
Ei at x. 

Lemma 4.10. For every x £ E^ there exist j £ N and y £ E° such that 

(4.11) \x — y\ < C6 
and 

(4.12) Z(T x T,i,T y T,j) < C6. 

Proof. Since x £ E^, we have x = F(z) for some z £ D l J W . By Lemma |4.6I there 
exists j such that qi and qj are neighbors and A t j(z) £ D^ 50 . Let y = F{Aij(z)), 
then y 6 Ej. Lemma 14.91 for m = 0 implies that 

\x-y\ = \F(z)-F(A ij (z))\<C5 
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M 


Figure 2. A schematic visualisation of the interpolation algo¬ 
rithm ‘ManifoldConstruction’ based on Theorem [TJ see Section [5] 
Assume that a finite metric space ( X , dx ) is given. Then, we con¬ 
struct local coordinate charts D\ C R n approximating the r-balls 
B*(xi) C X in the data space X. We embed these local charts 
to an Euclidean space E = K m using a Whitney-type embeddings 
F( l ) = F\u i : D ^ 10 —> Ej. Surfaces E* C E are denoted by blue 
curves. Using the algorithm Surfacelnterpolation, the union (J i E 
is interpolated to a red surface M C E. When Pm is the normal 
projector onto M, denoted by the red arrows, we can determine 
a metric tensor gi on Pm(E») by pushing forward the Euclidean 
metric from Di to Pm( Ej) by the map Pm oF\ d .. The metric ten¬ 
sor g on M is obtained by computing a smooth weighted average 
of tensors g-i- 


proving (14.111) . To prove (14.121) . observe that T^E, and T y T,j are parallel to the 
images of the derivatives d z F and d^. ( Z )F, resp. The image of d^. ( Z )F coincides 
with the image of d z (F o Ajj ). By Lemma 14.91 for to = 1 we have 

\\d z F — d z (F o Aij)\\ < C6. 

This and (14. (SI) imply (14.121) . □ 

We use general metric space notation for subsets of E. In particular, for a set 
Z C E and r > 0 we denote by U r {Z) the r -neighborhood of Z in E. 

Lemma 4.11. E D f7i/2(E°) C Ucsi^i) for every i £ N. 

Proof. Let q € E D U 1 / 2 (E°). Since q £ L r 1 / 2 (E°), there exists y € D V 50 such that 
|q — F(y) | < Since q £ E, we have q = F{z) where 0 £ Dy 10 for some j. Since 
the zth coordinate projection from E to R n+1 does not increase distances, 

\Fi{z) - F^ < I F(z) - F(y )| = \q- F{y)\ < 

Recall that Fi(y) = ifii(y) because y £ Di. Since y £ D^ 50 , the point g>i(y) belongs 
to the spherical cap S-[ / 10 - Hence | Fi(y) — 2e n+ i| < Therefore 

I Fi(z) - 2e n+ i| < | Fi(z) - Fi(y)\ + \ F,(y) - 2e n+ i| < 5 + 33 < 1 - 

Thus Fi(z) belongs to the spherical cap Si C S C R" +1 , in particular Fi(z) ^ 0. 
Hence Fi{z) = ipi(Aji(z)) and therefore Aji(z) £ (p~ 1 (Si ) = d] 1 ' 10 . 
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Since Fi(z ) ^ 0, Lemma 14.71 implies that qi and qj are neighbors. Now by Lemma 
14.91 (for m = 0) we have 

I q ~ F{Aji(z))\ = \F(z) - F(Aji(z))\ < CS. 

Since Ajj(z) £ D\ 10 , this inequality implies that 

q&Ucs(F(D 1 i /10 )) = Uc S (^i). 

Since q is an arbitrary point from the set £ fl Ui^fY^), the lemma follows. □ 

Lemma 4.12. For every q £Y® and every r > 0, 

d H {Yi (~1 B r {q), T q Yi (~l B r (q )) < Cr 2 . 

Proof. By Lemma 14.81 £; = F^D^/ 10 ) is a surface parametrized by a uniformly 
bi-Lipschitz smooth map F\ D i/w. We may assume that r < 50Co where Co is the 

bi-Lipschitz constant in (14.71) . Let q = F(x) where x £ D\/' M . Then every point 
q' £ £, fl B r {q) is the image of some x' £ B r / C0 (x) C B 1 / 50 (x) C D l ^ w . Hence 

dist (q',T q Y,i) < Cr 2 , 

where C 2 is the uniform bound of the second derivatives of F | D i/io, see (PTHD . This 
means that £,; deviates from its tangent space T q Yi within the r-ball B r (q) by 
distance at most Cr 2 . 

In addition, the point q £ £° = F(D]^ 50 ) is separated by a distance at least 
> 2r from the boundary of Therefore, for each point from T q Y>i fl B r (q) 
there exists a point in E, within distance ^r 2 . □ 

The next lemma essentially says that the £ C E is CWclose to affine spaces in 
E at a scale of order 5 1 / 2 . 

Lemma 4.13. For every x £ £i and every r > C5 1 / 2 , 

dn(Y fi B r (x),T x Yi fl B r (x )) < Cr 2 . 

Proof. By Lemma [4.101 there exists j £ N and q £ £° such that \x — q\ < CS and 
Z(T x Yi,T q Yj) < CS. Let A = T q Yj. Observe that the Hausdorff distance between 
the affine balls T x Yi 0 B r {x) and Bf(q) = A fl B r (q) is bounded by 

\x — q\ + r sin Z(T x Yi, A) < CS + CrS < Cr 2 

since <5 < cr 2 . Hence it suffices to verify that d#(£ fl B r {x), B x (q)) < Cr 2 . By 
the definition of the Hausdorff distance, this is equivalent to the following pair of 
inclusions: 

(4.13) YnB r (x)cU Cr 2 (B^(q)) 
and 

(4.14) B r A ( g )cl/ Cr! (£nB r (i)). 

Since \x — q\ < CS, we have B r (x) C B r+ cs(q) and therefore 

£ n B r {x) c £ n B r+C s{q) C £ D Ur+csiY®) C UcsiYj) 

where the last inclusion follows from Lemma 14.111 Hence 

£ D B r (x) c Ucs{Yj) D B r+ cs{q) C UcsiYj) n B r+2 cs{q)) C U C r 2 iB^ +2 csiq)) 

where the last inclusion follows from Lemma 14.121 and the assumption that 5 <r 2 . 
Since Bf + 2 CS {q) C U 2 csiB^{q)), this implies (14.131) . 
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It remains to verify (14.141) . Since \x — q\ < CS and r > 5 1 ^ 2 , we may assume 
that r > \x — q\. Let r\ = r — \x — q\. By Lemma 14.121 

B Xl (q) c [/ Cr 2(SflB ri (g)) C U Cr !(EnB r (i)). 

Since B^(q) C U r - ri (q)) and r — r 3 < CS < Cr 2 , this implies (14.141) and the 
lemma follows. □ 

4.3. The manifold M. We choose a positive constant r$ < 1 such that 

(4.15) C 0 r 0 < er 0 

where Co is the constant C from Lemma l4.13l and cro is the constant from Theorem [21 
Some additional requirements on ro arise in the course of the argument below, but 
the final value of ro depends only on n. 

We may assume that the constant Sq in Proposition 14.II satisfies So < crl, where 
c = C~ 2 with C being the constant from Lemma [4.131 Then, for S < So, Lemma 
14.131 implies that 

(4.16) df/(£ D B ro (x),T x Ei D B ro (x)) < Crl 

for every a : £ T,i. This and (14.151) imply that the assumptions of Theorem [2] are 
satisfied for £ in place of X, ro in place of r, Crl in place of S , and T x T,i in place of 
A x (for x £ £i). The conclusion of Theorem [2] with these settings is the following 
lemma. 

Lemma 4.14. Let £ satisfy If ro is sufficiently small and S < erg, then 

there exists a closed n-dimensional smooth submanifold M C E such that 

1. d ff (£,M) < Crl < j^r 0 < 

2. The second fundamental form of M at every point is bounded by C. 

3. The normal injectivity radius of M is at least ro/3. 

f. The normal projection Pm'- f/ ro / 3 (M) —^ M is C-Lipschitz and satisfies 
||d™PM|| < C m rl~ m for all m > 2 and x £ U ro /z(M). 

5. Z(T x Y,i, T Pm ( x }M) < Cr 0 for every x £ £,. □ 

In Lemma [4.141 If. the first inequality follows from Theorem [2] and the subse¬ 
quent ones follow from the assumption that ro is sufficiently small. The inequality 
d_y(£,M) < y_r 0 ensures that £ lies ‘deep inside’ the domain of Pm- The last 
assertion of Lemma 14.141 comes from Remark 13.121 

Let M C E be a submanifold from Lemma f4. 141 The fourth assertion of Lemma 
14.141 for to = 2 implies that 

(4.17) | \d x P M ~ dp M ( x) P M || < C dist(a;, M) < Cr 0 

for every x £ U ro /o(M). For x £ M , the map d x PM is the orthogonal projector 
in T X E onto T X M so that ||(4 ;-Pm|| < 1- This and (14.171) yield that ||(4 ;-PmH < 
1 + Cdist(x, M) for x £ U ro / 3 (M). Hence, x >->• Pm{x) is locally Lipschitz in 
x £ U ro / 3 (M) with the Lipschitz constant 1 + Cro- Below, we assume that ro is 
chosen so that this Lipschitz constant satisfies 1 + Cro < 2. 

Recall that the set £ = [J ■ £i = IJ, F{D 1 ^ 10 ) is contained in the domain of Pm- 
For each i, define a map ifi: D 1 J W — > M by 

i>i = Pm° F\ d i/w 

and let Vi be the image of tpi, that is 

Vi = P m {F{d] / w )) = P M (£i). 


Observe that 
(4.18) 


| ifi{x) - F(x )| < cZ/l(£, M) < Crl < jT 
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for every x £ D\ 10 . This follows from Lemma l4.14l l'l and the fact that ipfx) is 
the nearest point in M to F{x). 

The next lemma shows that the maps ipi provide a nice family of coordinate 
charts for M. 

Lemma 4.15. If r$ is sufficiently small and 6 < erg, then 

1. is uniformly bi-Lipschitz, that is, 

C~fx -y\< 1 4>i(x) - iffy) | < C\x - y\ 

for all x,y £ D^ 10 . In particular, V) is an open subset of M and ifi is a diffeo- 
morphism between Dand V,. 

2- U l MD 1 i /30 ) = M. 

3. If i,j £ N are such that V fl Vj then qi and qj are neighbors. 

Proof. 1. Since Pm and F\ D i/w are uniformly Lipschitz, so is their composition ipi. 
It remains to prove that 

(4.19) \ipfx) -ifi(y)\ > C- x \x-y\ 

for all x, y £ D 3 / 10 . Lemma [4.141 51 and (14.171) imply that for every x £ E* the 
restriction of 3 x Pm to T x T,i is Cro-close to a linear isometry between T x T,i and 
Tp M (x)M. This fact and (14.81) imply that there is C = C(n ) such that 

( 4 . 20 ) |d*V’i(«)l><r 1 M 

for all x £ D 1 / 10 and v £ R n . By Lemma I4.141 4L the derivatives of Pm up 
to the second order are bounded by a constant not depending on xq. By m 
it follows that the second derivatives of ifi are uniformly bounded. This, (14.201) 
and a quantitative version of the inverse function theorem imply that (14.191) holds 
whenever the distance \x — y\ is no greater than some constant cq depending only 
on n. 

To handle the case when \x — y\ > Co, observe that 

\M X ) - > \ F ( x ) - F (y )I - Cr l 

by (14.181) . Since ^1^ in uniformly bi-Lipschitz (by Lemma I4~S1) . it follows that 

(4.21) | ffx) - ipi(y)\ > G-fx -y\- Cr 2 0 

for all x,y £ D\' w . If \x — y\ > Cq and r 0 is so small that Cir^ < ^Cf 1 co where C\ 
is the constant C from (|4.21[) . then the right-hand side of (14.211) is bounded below 
by ^Cf x \x — y\. Thus (14.191) holds for all x,y £ D V 10 and the first claim of the 
lemma follows. 

2. Let x £ M. By Lemma 14.14! ll there exists z £ T, such that \x — z\ < Cr g. 
By Lemma T4. 1 01 there exists i £ N and y £ such that \y — z\ < CS. Then 

\x — y\ < Crl +C6 < Crl < r o/3 

where in the last inequality we assume that ro is sufficiently small. We are going 
to show that x £ F(D]^ 30 ). 

Since x £ M and \x — y\ < ro/3, the straight line segment [x,y\ is contained in 
the domain of Pm- Let 7 be the image of this segment under Pm- Then 7 is a 
smooth curve in M connecting x to the point Pm{v) G Pm(S°) = iffD^ 50 ). Since 
Pm is locally 2 -Lipschitz, we have length^) < 2\x — y\ < Crl■ We parametrize 7 by 
[ 0 , 1 ] in such a way that 7 ( 0 ) = Pm(v) and 7 ( 1 ) = x. Suppose that x £ iffD^ 30 ) 
and let 

t 0 = minjf S [ 0 , 1 ] : 7 (t) £ V>i(L>i /3 °)}. 
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This minimum exists since ^(£)V 30 ) is an open subset of M. Define 7 (t) = 
i/>“ 1 ( 7 (t)) for all t £ [0,to)- Note that t 0 > 0 and 7 ( 0 ) £ D ^ 50 because Pm{v) G 
50 ). Since ifi is a diffeomorphism onto its image, 7 is a smooth curve in Di. 
Moreover, since ipi is uniformly bi-Lipschitz, we have 

length^) < Clength( 7 ) < Gr\. 

Hence the limit point p = lim t _ ) . to 7 (t) exists and satisfies 

I P - 7(°)l < length( 7 ) < Cr\. 

We may assume that r 0 is so small that the right-hand side of this inequality is 
smaller than ^ — A,. Since 7 ( 0 ) G D^ 50 , it follows that 2 G D\^ 0 . Hence 
7 (^ 0 ) = 4>i{p) G ipi(D]^ 30 ), contrary to the choice of t Q . This contradiction shows 
that x G ^(-D^ 30 ). Since x is an arbitrary point of M, the second claim of the 
lemma follows. 

3. Assume that V. D Vj ^ 0. Then there exist x G .dV 10 and y G D j ^ 10 such 
that ipi(x) = ipj(y)- This equality and (14.1811 imply that \F(x) — F(y)\ < hence 

(4-22) 1*10* :)-F i (y)\ < l, 

(recall that F r : 0 —> R ” +1 is the ith coordinate projection of F). Since x G D^/ 10 , 
the point Fi(x) G ffi n+1 belongs to the spherical cap Si and therefore |F, 0*01 > !■ 
This and (14.221) imply that Fj (y) ^ 0 and hence y,; and qj are neighbors by Lemma 
15771 □ 

Note that Lemma 14.15T 34 and Lemma 14.21 21 imply that the sets V) cover M 
with bounded multiplicity, that is, for every x G M the number of indices i such 
that x G Vi is bounded by a constant depending only on n. 

Now we can fix the value of ro such that Lemma 14.141 and Lemma 14.151 work. 
Since ro is yet another constant depending only on n, we omit the dependence on 
ro in subsequent estimates and just use the generic notation C . In particular, the 
fourth assertion of Lemma 14.141 now implies that 

(4.23) \\PM\\c m {u ro/ 3 (M)) < C m 
for all to > 0. This and (14.61) imply that 

(4.24) IIV-ilU^AO) < Cm 
for all to > 0 . 

Lemma 4.16. If x G D 1 ^ 10 , y £ D 1 ^ 10 trod ifi{x) = ifj(y), then 

(4.25) |F(x) - F(y)\ < C6. 

Proof. Applying Lemma H.IOI to the point F(x) G £* yields that there exists k G N 
and a point z G D]J 50 such that | F{x) — F(z)\ < CS. Since Pm is uniformly 
Lipschitz, it follows that 

(4.26) | ipi{x) - ipk(z )| < CS 
and (since ifi(x) = ipj{y)) 

(4-27) \ijj j {y)-ifk(z)\<C6. 

This and (14.181) imply that | F{y) — F{z)\ < \ + CS < |, hence F(y) £ ( 7 i/ 2 (£fe). 
By Lemma [4.111 it follows that F(y) £ Uc&ifPk)- This means that there exists 
z' £ d]/ W such that 
(4.28) 


\F(z')-F(y)\<CS. 
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Then 

I Mz') - Mv )I = I Pm(F(z')) - P M {F(y))\ < CS. 
since Pm is uniformly Lipschitz. This and (14.271) imply that \ipk{z) — ipk{z')\ < CS. 
Since ipk is uniformly bi-Lipschitz by the first claim of the lemma 14.151 it follows 
that 

\z - z'\ < C\ipi(z) - ipi(z ')| < CS 

and hence | F{z) — F(z')\ < CS by Lipschitz continuity of F. This and (14.281) imply 
that \F(y) — F(z)\ < CS. 

Thus we have shown that (14.271) implies that | F(y) — F(z)\ < CS. Similarly 
(14.261) implies that |F(x) — F(z)\ < CS and (14.251) follows. □ 

1 /15 

We are going to restrict our coordinate maps ipi to smaller balls D % . Let 
V- = ipi{Dy ib ) and Uij = D V-). The set Uij C D l J lb is the natural 

domain of the transition map ipj 1 o ipi between the restricted coordinate charts. 

Lemma 4.17. Let i,j £ N be such that V[ D Vj ^ 0. Then 

(4.29) \\ipj 1 o ^ - AijWcm^..) < C m 5 
for all m > 0. 

Proof. Note that qi and qj are neighbors by Lemma [4.151 3b By Lemma [4.91 it 
follows that A i j{D 1 J w ) C Dj. Consider the map G: D) ,/10 —> E defined by G = 
F o Aij\ D i/w. By Lemma T4.91 we have 

(4.30) \\G-F\\ Cm(D in 0 ) <CmS. 

This and Lemma l4.14f ll imply that the image of G is contained in the domain 
of Pm, so we can consider a map ipi: D\^ 10 —f M defined by ipi = Pm o G. The 
relations (14.301) and (14.231) imply that 

11 *0* — 0*11 C'm(Z3 1 / 10 ) ^ CmS- 

If 5 is sufficiently small, this and Lemma l4.15f ll imply that ipi is a diffeomorphism 
onto its image, the image of ipi contains V[, and the composition ip~ 1 oip i is CJ-close 
to the identity, more precisely, 

(4.31) IIC 1 oifi- id|| Cm(D i/i5 ) < C m 5. 

Let us show that Aij(Uij ) C D^ 10 . Let x £ Uij and z = Aij(x). Then \F(x) — 
F{z) | < CS by Lemma FTTH Let y £ Uji be such that ipj{y) = ipi{x). Then 
l-F(x) — F(y )| < CS by Lemma [4. 161 Therefore \F(y) — F(^)| < CS. Since F\n } is 
uniformly bi-Lipschitz by Lemma POT 2b it follows that 

\ y -z\<C\F(y)-F(z)\<C 6 <±-± 

if 5 is sufficiently small. Since y £ Uji C D^ 15 , this implies that z £ D^ 10 . 

Thus we have shown that Aij{Uij) C Dj°. This implies that 

i’iluij = Pm ° F o Aij\ Ui} = ipj o Aij\ Vij 

and therefore 

nv; = Arf o V^lvvnv;- 

Now (14.311) implies that 

\\A~j 1 o iff 1 oifi- id||c"*(£/ y ) < C m S. 


and (14.291) follows. 


□ 
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4.4. Riemannian metric and quasi-isometry. Now we are going to equip M 
with a Riemannian metric g such that the resulting Riemannian manifold (M, g) 
satisfies the assertions of Proposition 14.11 (The metric induced from E is not 
suitable for this purpose. One of the reasons is that its curvature is bounded by C 
but not by CS.) 

First we observe that there exists a smooth partition of unity {iq} on M subor¬ 
dinate to the covering (V)'} and such that 


(4.32) ||Uj ° ll( 7 m (D 1 /i 5 ) ^ 

for all i,j £N and all m > 0. To construct such a partition of unity, fix a smooth 
function h : R n —»• R + which equals 1 within the ball B 1/30 (0) and 0 outside the 
ball - 61 / 15 ( 0 ). Then define Ui: M —> R + by 


Ui{ x) 


h(ip i 1 {x) - p^, \ix£V( 

0 , otherwise. 


Finally, let u = ]T) j; tq and Ui = Ui/u. Lemma 14.171 implies that 


\\Uj°1pi\\c m( D 1/15 ) ^ C m 

for all i.j £ N and all m > 0. Since the sets V( cover M with bounded multiplicity, 
it follows from Lemma r4.15f 2l that a similar estimate holds for and (14.3211 

follows. 

For every i £ N, define a Riemannian metric gi on V) by gi = (ip] _1 )*1?-E where gE 
is the standard Euclidean metric in El / ” 1 0 C R" and the star denotes the pull-back 
of the metric by a map. In the other words, gt is the unique Riemannian metric 
on Vi such that ipi is an isometry between D l J W and (Vi,gi). Then Lemma f l. 1 71 
implies that 


(4-33) \\ip*gi - gsWc^iUij) < C m S 

for all to > 0 and i, j £ N such that V( fl V’ ^ 0. Define a metric g on M by 
g = utgi. The pull-back ip*g of this metric by a coordinate map t/jj has the form 

(4.34) ip*g = (Ui o ipj)- ip* gi . 

By (14.321) and (14.331) it follows that 


(4.35) 


Wj9 - 9E < C m S. 


So in the local coordinates defined by ipj on V- the metric tensor is Ci5-close to the 
Euclidean one and its derivatives up to the second order are bounded by CS. So are 
the sectional curvatures of the metric. Thus (M, g) satisfies the second assertion of 
Proposition 14.11 

Let d g : M x M —► R + be the distance induced by g. The estimate (14.3511 implies 
that the coordinate maps ipi are almost isometries between the Euclidean metric 
on Z),- / l5 and the metric g on V[. More precisely, ipi distorts the lengths of tangent 
vectors by a factor of at most 1 + CS. Therefore 


(4.36) 


(i+CT)- 1 <' w ; w ’ , (- (i '» 
F — y\ 


< 1 + cs, 


for all x,y £ Cl / 3<l . (The ball D\ /3 ° here is twice smaller than the domain where 
ipi is almost isometric. This adjustment is needed because the d 3 -distance between 
points in V{ can be realized by paths that leave V{.) 

Now we construct a (1 + CS , Cc>)-quasi-isometry 4/: X —> M. Recall that Xq = 
{9i}iPLi is a i 5 o _ne f i n our original metric space X and for each i £ N we have a 
25-isometry /): B\ {qi) Di such that fi{qi) = Pi. We construct if: X —> M as 
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follows. For every x £ X, pick a point q g £ Xq such that dx{x,qj) < ygg and 
define = ipj(fj{x)). The next lemma shows that the choice of qj does not 
make much difference. 


Lemma 4.18. Let x £ X and qi £ Xq be such that dx{x,qi) < yg . Then fi{x) £ 
D^ 15 and 

(4.37) d s (T(x), ipi(fi{x))) < CS. 


Proof. Let qj be the point of Xq chosen for x in the construction of T. Then 
dx(x,qj) < ygg and 'F(x) = ipj(fj(x)). By the triangle inequality, 

dx(qi,q 3 ) < ^ + iso < i 

hence g,; and qj are neighbors. Observe that \fi(x) — Pi| < gg +C5 since pi = fi{qP) 
and fi is a 25-isometry. Similarly, \fj(x) — Pj\ < ygg + CS. Hence fi{x) £ D]^ 15 
and fj{x) £ D 1 / 50 . By gT]), the point fj(x) is C^-close to dy(/i(a;)), hence T(x) 
is C5-close to ipj(Aij(fi(x))). By Lemma [TUI (for m = 0) and Lipschitz continuity 
of Pm, the latter is C5-close to ipi(fi(x)). Thus 

l'F(x) - ipi(fi(x))\ < CS. 


This implies that 

(4.38) d M (^{x),i>i(fi{x))) < CS 

where cZm is the intrinsic metric of M induced from E. Indeed, the points a = 'F(x) 
and b = ipi(fi(x )) can be connected in M by the Pm -image of the line segment [a, 6 ], 
and the length of this path is bounded by CS. By construction, the metric d g on 
M is bi-Lipschitz equivalent to c?m (with bi-Lipschitz constant depending only on 
n). Hence (14.381) implies (14.371) . □ 


Now let us show that 'F(X) is a C5-net in (M,d g ). Let Y = \J i 'tpi(B 1 / 20 (qi)). 
It follows from Lemma f4. 181 that Y is contained in a Cd-neighborhood of ’F(X) in 
(. M,d g ). Hence it suffices to prove that Y is a (75-net in (M,d g ). Since fi is a 
25-isometry, the set fi(B 1 / 20 (q i )) is a C5-net in the ball j^C 20+cs _ The ^-images 
of these balls cover M by Lemma fd. 151 21. Since each ipi almost preserves the metric 
tensor, it follows that Y, and hence ’F(X), is a C5-net in (M,d g ). 

Lemma 4.19. For all x,y £ X such that dx{x,y) < ygg or d 3 (4'(x), ^t(y)) < ygg, 
one has 


(4.39) \d g (^(x),^>(y)) - d x (x,y)\ < CS. 

Proof. Let x £ X and q 3 be the point of Xo chosen for x in the construction of T, so 
that d x {x, qi) < ygg. ThenT(x) = ipi(fi(x)). Note that \fi(x)-pi\ < ygg+<75 < yg 
since p 3 = f%{qi) and fi is a 25-isometry. (Recall the definitions in Section |4Tj) 
First, we consider the case when y £ X is such that dx{y,qi) < ygg- Since /, is 
a 25-isometry, \fi(y) — Pi\ < ygg + CS < yg and the distance | fi(x) — fi{y)\ differs 
from dx(x, y) by at most 25. The above and (14.361) imply that 

I dg(ipi(fi(x)),ipi(fi(y))) - d x (x,y)\ < CS. 

This and Lemma 14.181 prove (14.391) when dx{y,qf) < y§g- 

In particular, this proves the claim of the lemma in the case when dx(x, y) < ygg 
as then by the triangle inequality we have dx(y, qi) < Ygg + joci < ygo • 

Second, we consider the case when y £ X is such that d 9 ('F(x), T(y)) < ygg. For 
every r > 0, denote by Bi(r) the ball of radius r in M with respect to d g centered 
at ijji(pi). Since ipi almost preserves the metric tensor, we have 

Bi(± - CS) C V[ = MD] /15 ) C Bi(± + CS). 
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Since | fi(x) — Pi\ < y^g + CS, it follows that the point ^(a;) = ipi(fi ( x )) belongs to 

^(tso + cs ) and hence ®(y) e B i(m + m + cs ) = B ^h + cs ) c v l- 

Let qj be the point of Xq chosen for y when defining if that satisfies dx(y , qj) < 
jig. Since \f( y ) £ V[, the point 2 := iff 1 (if (y)) = zfff 1 o ipj(fj(y)) is well-defined. 
Moreover, 2 lies within distance yg + CS from pi since 'f(y) £ i?i(Jg + CS). By 
Lemma f4.17l 2 is C5-close to Aij(fj(y)) and the latter is C5-close to fi(y) by 
(HU). Hence \fi(y) — Pi\ < yg + CS. Since ft is a 25-isometry, it follows that 
dx{y, qi) < -^q + CS < y^g. Thus, (14.391) follows from the first part of the proof. 

□ 

Lemma f4.191 and the fact that \f(Jf) is a CS -net in (M,d g ) imply follows that if 
satisfies the assumptions of Lemma 12.41 with r = y^g and CS in place of <5. Lemma 
12.41 implies that if is a (1 + CS, C5)-quasi-isometry from X to (M, d g ) and the 
first claim of Proposition 14.II follows. The second claim is already proven above. It 
remains to prove the third claim of Proposition l4.ll Since if is a (1 + CS, C5)-quasi- 
isometry, every unit ball in ( M,d g ) is GH C5-close to a unit ball in X and hence 
in R ra . Therefore one can apply Proposition 11.71 with M = R", p = 1, K = CS, and 
(-Bf(O), B ^ 1 (x)) < CS, where x £ M. This yields that inj M > 1 — CS > \. 
This finishes the proof of Proposition 14.11 and the proof of Theorem [T] 

Remark 4.20. The quasi-isometry parameters in Theorem [l] are optimal up to con¬ 
stant factors. To see this, assume that a metric space X is (1 + Sr -1 , 5)-quasi- 
isometric to an n-dimensional manifold M with | SecM | < 5r _3 and | inj M | > 2 r. 
Then by (11.71) the r-balls in X are GH C5-close to r-balls in M. Furthermore, by 
(11.11) the r-balls in M are GH C5-close to r-balls in R". Hence X is C5-close to R" 
at scale r. 

Thus the assumption of Theorem Q] that X is (5-close to R ra at scale r is necessary, 
up to multiplication of the parameters by a constant factor depending on n. The 
assumption that X is (5-intrinsic could be weakened, but it is not really restrictive 
due to Lemma 1231 

Proof of Corollary M First we prove the first inclusion in (11.101) . Let X be a 
metric space from the class Xis/e{n, K/ 2, 2i$, D — S). Then there exists a manifold 
M £ M(n,K/2,2io,D — 5) such that dcH{M,X) < |. Hence every p-ball in X 
is GH |-close to a p-ball in M. Take p = r = (5/LT) 1 / 3 < Iq, then by (11.11) 
we have dcH{B^ 1 (x), H”) < \Kr 3 = | for every x £ M. Hence every r-ball 
in X is GH 5-close to B Thus X is 5-close to R" at scale r. Similarly X is 
5o-close to R n at scale r$. Since dcH^AI, X) < | and the Riemannian manifold 
M is a length space, Lemma 0(1) implies that X is 5-intrinsic. We also have 
diam(X) < diam(M) + 2dcH (X, M) < D. Thus X £ X, proving the first inclusion 
in (11.101) . 

Now let us prove the second inclusion in (11.101) . Let X £ X. Recall that 5 = Kr 3 , 
5o = iftg and 5 < 5o- Therefore r < io and 5r -1 < 5o*g 1 < 02 - If 02 is sufficiently 
small then by Theorem [[] there is a manifold M which is (1 + CSr -1 , C5)-quasi- 
isometric to X and has | SecM | < C5r ~ 3 = CK. Let us show that inj M > io/3. To 
see this, apply Theorem [1] to zq and Sq in place of r and 5. This yields a manifold 
M 0 which is (1 + CSoig 1 , C5o)-quasi-isometric to X and has | SecM 0 | < CSqZq 3 = 
CK and inj Mo > i 0 /2. Since 5 < 5o and 5r _1 < 5o* ( )j" 1 , both M and M 0 are 
(1 + CSqIq 1 , C5o)-quasi-isometric to X. Hence they are (1 + CS 0 Zq , C5o)-quasi- 
isometric to each other. This fact and Proposition II.71 21 imply that 

inj M > (1 - CSoio 1 ) min{inj Mo , } > (1 - CSoiff 1 ) 1 -^ > j 















42 


FEFFERMAN, IVANOV, KURYLEV, LASSAS, NARAYANAN 


provided that tT 2 is sufficiently small. 

By m we have (1gh(X,M) < CSr^D. Therefore diam(M) < D{ 1 + C8r ~ * 1 2 ). 
Let Mi be the result of rescaling M by the factor (1 + C8r~ 1 )~ 1 where C is the con¬ 
stant from the above diameter estimate. Then diam(Mi) < D and <1gh(M, Mi) < 
C8r~ 1 D. Hence 

(4.40) d GH {X,M{) < d GH (X,M) + d GH {M,M 1 ) < CSr^D = CDK 1/3 6 2/3 . 

We may assume that a-i is so small that the above scale factor between M and Mi is 
greater than |. Theninj Mi > |inj M > *o/4 and therefore Mi £ A4(n, CK, *o/4, D). 
This and (14.401) imply the second inclusion in (11.91) and Corollarv ll. 41 follows. □ 

5. Algorithms and proof of Corollary 11.81 

The constructive proofs of Theorems [T] and [2] yield algorithms that can be used 
to produce surfaces or manifolds from finite data sets. We give only the sketches 
of the algorithms. The algorithms use the sub-algorithms FindDisc and GHDist 
given in Sections 12.HI and 12.41 In the description of the algorithm we assume that 
the data set A is finite. 

First we outline the algorithm based on Theorem [2] 

Algorithm Surfacelnterpolation: Assume that we are given the dimension n, the 
scale parameter r, and a finite set points Ac E = R^. We suppose that X is 
(5r-close to n-flats at scale r where 8 is sufficiently small. Our aim is to construct 
a surface M C E that approximates the points of X. We implement the following 
steps: 

(1) We rescale X by the factor 1/r. After this scaling, the problem is reduced 
to the case when r = 1. 

(2) We choose a maximal -separated set Ao C A and enumerate the point 
of Ao as {gi}^. We apply the algorithm FindDisc to every point qi £ Ao 
to find an affine subspace Ai through qi such that the unit n-disc 2bnPi(gi) 
lies within Hausdorff distance CS from the set A C\ Bi(qi). We construct 
the orthogonal projectors Pc E -A E onto A t . 

(3) We construct the functions tpi : E E, defined in (13.51) . that are convex 
combinations of the projector Pj and the identity map. Then we iterate 
these maps to construct f: E —> E, f = ipj o ipj_ r o ... o , see AMD - 

(4) We construct the image M = f(Us( A)) of the ^-neighborhood of the set A 
in the map /, see Remark 13.101 

The output of the algorithm Surfacelnterpolation is the n-dimensional surface 
M CP. 

The algorithm based on Theorem [j] is the following. 

Algorithm ManifoldConstruction: Assume that we are given the dimension n , 

the scale parameter r, and a finite metric space (A, d). Our aim is to construct 
a smooth n-dimensional Riemannian manifold ( M,g ) approximating (A, d). We 
implement the following steps: 

(1) We multiply all distances by 1/r. After this scaling, the problem is reduced 
to the case when r = 1. 

(2) For each a; £ A, we apply the algorithm GHDist to the ball B\{x) C A to 
find the value 8 a (x). Define 8 a = max l£ x 8 a {x). 

Note that, by Lemma l^75l the values S a {x) estimate the Gromov-Hausdorff 
distance between the ball Bi(x) and P™. Thus A is 2<5 a -close to R™ (see 
Definition ED- We require that S a is smaller than the constant 8 0 (n)/2 
given in Proposition l4.ll If this is not valid, we stop the algorithm and give 
the output that the data does not satisfy the needed assumptions. 
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(3) We select a subset a maximal y^-separated set X 0 C X and enumerate the 

points of X 0 as We choose a set {pi}^Li such that the unit balls 

Di = C R" are disjoint. For every q,j £ Xq, we apply the algorithm 

GHDist to find a (5 a -isometry /, : B\{qi) —> Di. 

(4) For all qi, qj £ Xq such that d(qi, qj) < 1, we construct the affine transition 
maps Ajj : R n —y R n , see Lemma T4.31 and Remark 12.61 

(5) Denote D 0 = [jf = 1 D 1 ^ 10 , where D 1 ^ 10 = B l / W {p i ) C R", and E = 
jj(n+i)iv_ We construct a Whitney embedding-type map 

F(x) = (Fi(x))*jL x 

where F t : Oq —>• R ” +1 are given by (|4.4ll . 

( 6 ) We construct the local patches E,; = F(D\' W ) and maximal cro-separated 
subsets {yi,k}k =i °f S*, where oq is the constant from Proposition 11.51 

(7) We apply algorithm Surfacelnterpolation for the points {yiy, 1 < * < 
N, 1 < k < Kj} to obtain a surface M C E. We construct the normal 
projector Pm'- U 2 /s(M) — > M for the surface M. We note that in this 
algorithm Pm can be replaced by the map / constructed in the step 3 of 
the algorithm Surfacelnterpolation , see R.emark l3.131 

( 8 ) We construct maps ifi = Pm 0 F\ D 1/10 : D—> PM^j) C M. 

(9) We construct metric tensors on sets Pm (Si) C M by pushing forward the 
Euclidean metric g e on Do to the sets Pm( Si) using the maps ipi. Then 
metric g on M is constructed by using a partition of unity to compute a 
weighted average of the obtained metric tensors, see (14.341) . 

The output of the algorithm is the surface M C E and the metric g on it. Note 
that the algorithm uses only the distances within r-balls in X , 

Remark 5.1. The manifold M given by the algorithm ManifoldConstruction can be 
represented using local coordinate charts. The algorithm gives sets Ui = O ] /3() C 
R", that can be considered as local coordinate charts of M, the metric tensors 
g(x) on these charts, and the set A f of the pairs (i , j) such that V’i(Pi)n^>j ( Uj) ^ 0 . 
Moreover, the algorithm gives for all (i,j) £ M the relations {(x, x') £ Ui x Uj : 
ipi{x) = ipj(x')} that are the graphs of the transition functions rjji = ipj 1 o ipi that 
map Vij = <p“ 1 ((p i (C/ i ) D to Vji = D <Pj(Uj)). The collection 

of Ui, pW : Ui R" X71 , and jjy : —> Vji , (z,j) £ W is a representation of the 

Riemannian manifold M in local coordinate charts. Using this representation we 
can determine the images of a geodesic 7 a; 0l { 0 (s), emanating from (xo, £ 0 ) £ TM, on 
several coordinate charts Ui and determine the metric tensor in normal coordinates, 
m- Thus, for practical imaging purposes, the algorithm ManifoldConstruction can 
be continued with the following steps 

(10) For given xo £ M, determine the metric tensor g in the normal coordinates 
given by the map exp Xo : {£ £ T Xo M : ||£|| 9 < p}, where p < inj M . 

(11) For given xo £ M and independent vectors £i >£2 £ T Xq M, visualise the 

properties of the metric g, e.g. the determinant of the metric in the normal 
coordinates, using in the set ||si£i + 1|^ < P the map s = (s\,S 2 ) > 

det(g(exp Xo (si£i + S 262 ))), that produces images of two-dimensional slices. 

Finally, we prove Corollary 11.81 

Proof of Corollary 1 1.81 Let us consider S < So, where So = 5oin,K) is chosen 
later in the proof, and r = ( S/K ) 1 / 3 . Then r < tq, where ro = ( Sq/K ) 1 / 3 . By 
(hud, the manifold N is (5-close to R n at scale rj 2 provided that above ro < 
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min{ K 1 / 2 , ^injjy}. Hence the set X with the approximate distance function d 

is C(!>-close to R™ at scale r/2. As in Lemma [2731 we can replace d by a CJ-intrinsic 
metric d' on X. This can be done with standard algorithms for finding shortest 
paths in graphs. By Lemma [2~T1 ( X , d') is (1 + CJr -1 , CJ)-quasi-isometric to N. 

The metric space (X,d') is CoJ-close to R" at scale r/2 for some absolute con¬ 
stant Co- We may assume that <5o = Jo(u,AT) satisfies Jo < K~ 1 / 2 a^ 2 , where 
cti = <Ti(n) is given in Theorem [Q Then Jo < orro- 

As in the above algorithm ManifoldConstruction , using the given data one can 
construct a manifold M = (M,g) which is (1 + CJr -1 , CJ)-quasi-isometric to X 
and has | SecM | < C\K. Since both M and N are quasi-isometric to X with these 
parameters, they are (l + CJr _1 , CJ)-quasi-isometric to each other. By Proposition 
ll.bl it follows that there exists a bi-Lipschitz diffeomorphism between M and N with 
bi-Lipschitz constant l + CJr -1 = 1 + CK 1 / 3 8 2 / 3 . Thus M satisfies the statements 
1 and 2 of Corollary B.81 

To verify the last statement of Corollarv ll.81 assume that Jo = 8 a(n, K) is chosen 
to be so small that ro = (Jo/A') 1 / 3 < (C/A') -1 / 2 . Then Proposition II. 71 21 applies 
to M and M = N with CS in place of J and C\K in place of K. It implies that 

inj M > (1 - CJr _1 )min{inj w -,7r(C , iif) _1/2 }. 

We may assume that Jo is so small that the term 1 — CJr -1 = 1 — C A' 1 / 3 J 2 / 3 in 
this estimate is greater than 4. Then the last statement of Corollarv ll.81 follows. 
Choosing Jo = Jo (n, K) so that the above conditions for Jo and ro are satisfied, we 
obtain Corollarv ll.81 □ 


6. Appendix: Curvature and injectivity radius 

The main goal of this appendix is to prove Proposition 11.71 We begin with 
recalling some facts about Riemannian manifolds of bounded curvature and proving 
the estimate (ED 

Let M = ( M , g) be a complete Riemannian manifold with | SecM | < AT where 
AT > 0. For p £ M, consider the exponential map exp p : T p M —> M. We restrict 
this map to the ball of radius r < tt/Vk in T p M centered at the origin. As 
a consequence of Rauch Comparison Theorem, exp p is non-degenerate in this ball 
and we have the following estimates on its local bi-Lipschitz constants: for y £ T p M 
such that \y\ = r < n/VK and every £ £ T p M \ {0}, 


( 6 . 1 ) 


sin (\fKr) |dj / exp p (^)| sinh(-\/RV) 


< 


l£l 


< 


\J~Kr 


\J~Kr 

(see e.g. [BT] Thm. 27 in Ch. 6] and [Mj Thm. IV.2.5 and Remark IV.2.6]). 

If r < and r < | inj M (p) then the geodesic r-ball B ^ 1 (p) is convex, i.e., 

minimizing geodesics with endpoints in this ball do not leave it (see e.g. [611 Thm. 
29 in Ch. 6]). This makes the local bi-Lipschitz estimate (16.11) global: 

sm/i/AV) dM(exp p (y),exp p (z)) sinh(v / Ar) 


< 


< 


( 6 - 2 ) _ 

V Kr | y - z\t p m \JKr 

for all y,z £ T p M such that 

(6.3) max{|y|, \z\} < r < min{^, \ inj M (p)}. 

Since sint >t — ^t 3 and sinht < t + jt 3 for0<t<7r/2, (16.21) implies that 


(6.4) 


|d M (exp x (i/),exp ( 2 )) - \y - z\t p m\ < \Kr 3 
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for all y, z £ T p M satisfying & This means that the restriction of exp p to the 
r-ball in T p M is a (^A> 3 )-isometry onto B^ 1 {p) and (11.11) follows. 

In the sequel we will need Toponogov’s Comparison Theorem (see e.g. [fiT i Thm. 
79 in Ch. 11]), which can be formulated as follows. Let Mf K denote the rescaled 
hyperbolic plane of curvature —K. For real numbers a, b > 0 and a £ [0, 7 r], denote 
by Y -K{a,b,a) the length of the side X 1 X 2 of a triangle AxqX\X2 in Mf K such 
that the sides x 3 x\ and XqX 2 equal a and b and the angle at xq equals a. Note 
that Y_x(a, &, a) is monotone in a: if a' > a , then Y_/f(a, b, a') > Y -x(a,b, a). 
Toponogov’s Theorem asserts that, if 71 and 72 are minimizing geodesics in M 
connecting po £ M to p± and P 2 , resp., with length^) = a, length^) = b and 
^( 711 Y 2 ) = a, then d M (pi,P 2 ) < Y- K {a,b,a). 

The following lemma is the key part of the proof of Proposition 11.71 

Lemma 6.1. There exists 03 = a 3 (n) > 0 such that the following holds. Let M 
and M be complete n-dimensional Riemannian manifolds with | Secjvf | < K and 
I Sec| < K, where K > 0, and 

0 < r < min{^, inj^(S)}. 

Let x € M, x £ M, and assume that 

d G H(B^(x),B^(x)) < S < a 3 r. 

Then inj M (x) > r — 20 S. 

Proof. We fix a metric on the disjoint union of the balls Bf 4 (x) and B^ 1 (x) realizing 
the GH distance between them. We say that points y £ B^(x) and y £ B^ 4 (x) are 
GH approximations of each other if the distance between them in the metric on the 
union is less than 6 . By the definition of the GH distance, every point in one ball 
has at least one GH approximation in the other ball. Since we are working with 
pointed GH distance, the centers x and x are GH approximations of each other. 

The statement of the lemma is scale invariant so we may assume that r = 7 r and 
hence K < 1. Let r 3 = inj M (x) and suppose that 

(6.5) 20 6 <r 0 <r- 205. 

Since r 3 < r < ^=, Klingenberg’s Lemma (see e.g. |61i Lemma 16 in Ch. 5]) implies 
that there exists a geodesic loop 7 of length 2r 0 in M starting and ending at x. 
Let y be the midpoint of this loop and 71 , 72 the two halves of 7 between x and y. 
Note that 71 and 72 are minimizing geodesics and dM(x,y) = ro- 
Let y £ B^(x) be a GH approximation of y. Then 

d^(x, y) < d M {x, y) + 25 < r - 18(5. 

Since inj j^(x) > r, it follows that there is a point z £ B^ 1 (x) such that dj^(y,z) = 
185 and y belongs to the minimizing geodesic from x to z. Let z £ B^(x) be a GH 
approximation of z and let a = dM(y, z). Since the triangle inequality in M turns 
to equality for x, y , z and / is a 5-isometry, we have 

to + a = d M {x, y) + d M (y, z) < d M (x, z) + 65, 

or, equivalently 

( 6 . 6 ) dhi{x, z) > ro + a — 65. 

Also note that 

(6.7) |a - 185| = \d M (y, z) - d^(y, z)\ < 25. 

Let 73 be a minimizing geodesic between y and z. Consider the angles ^( 73 , 71 ) 
and Z( 73 , 72 ) at y. Their sum equals it, hence at least one of them is no greater 
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than Assuming w.l.o.g. that Z( 73 , 71 ) < we apply Toponogov’s comparison 
to the hinge of 73 and 71 . This yields 

( 6 . 8 ) d M {x,z) < T - K (a,r 0 , f) = Y_i(a,r 0 , f) 

(recall that K = 1). Let us show that 

(6.9) T-i (a, r 0 , f) < r 0 + 

provided that S is sufficiently small. Since a < ro by ( 1531 ) and UEZD, we have 

(6.10) T_i (a, r 0 , f) < T_i(a, a, f) + r 0 - a 

by the triangle inequality in the hyperbolic plane. By rescaling, 

Y_i(a,a, = a • Y_ a 2 (l, 1 , ^) ~ a\Pl, a —> 0 . 

The asymptotic equality here follows from the fact that the rescaled hyperbolic 
plane converges to R 2 as the curvature goes to 0. Since y/2 < |, it follows that 
Y_i(a, a, ]}) < |a if a is sufficiently small. This and (16.101) implies (16.91) . 

Now (ESI and (16JSJ) imply that 

cIm(x, z) < ro + \a = ro + a — \a < r 3 + a — 85 

where the last inequality follows from (16.71) . This contradicts (16.61) . therefore the 
assumption ( 1531 ) was false. 

Thus we have either ro > r — 205 or ro < 205. In the former case the assertion 
of the proposition holds, so it remains to rule out the case when r 0 < 205. 

Suppose that the proposition is false. Then there exists a sequence Si —> 0 
and complete Riemannian manifolds Mi, Mi with points 27 £ Mi and 27 £ Mi 
satisfying the assumptions of the proposition with r = 7 r, K = 1, 5 = Si and 
such that inj M . (27) < 205^. Due to uniformly bounded curvature, the sequences 
{(Mi, 27)} and {(Mi, a;*)} are pre-compact in the pointed GH topology, see e.g. 
El Ch. 10, Corollary 31(2)]. Passing to a subsequence if necessary we may assume 
that (Mi, 27) and (Mi, 27) converge to pointed metric spaces (A', 2:) and (X,x). The 
limit spaces X and X are Alexandrov spaces of curvature > —K. See [25] Ch. 10] 
or El for basics of Alexandrov space geometry. We are going to use the fact that 
Alexandrov spaces with curvature bounded below are dimensionally homogeneous. 
Furthermore an Alexandrov space X contains an open dense subset which is a 
Lipschitz manifold whose dimension equals the Hausdorff dimension of X. 

Since (27), B (27)) < Si —> 0, the balls B*(x) and B^(x) in the limit 

spaces are isometric. Since inj M . ( 27 ) —> 0, the sequence {Mi} collapses, therefore 
the dimension of Ao is strictly less than n. This follows from e.g. HI . Theorem 0.9], 
see also El an d [S Ch. 8 ]. On the other hand, since inj ry (x t ) is bounded away 

from zero, the limit space X is an n-dimensional manifold, see e.g. [H § 8 D], Due to 
dimensional homogeneity of Alexandrov spaces, it follows that dim^(i?{ f (x)) = n 
and dim h(B^(x)) < n. Hence these balls are not isometric, a contradiction. This 
finishes the proof of Lemma 16.11 □ 

Proof of Proposition E3 i. Define C = max{20, fJ 3 : } where (73 is the constant 
from Lemma [5711 Let d = dcH(B^ 1 (x), B^ 1 (x)). If d < a 3 p then (11.111) follows 
from Lemma 1531 bv setting r = p. Otherwise ( 11 . 111 ) holds for the trivial reason that 
its right-hand side is nonpositive. This proves claim (i). 

2. Let p = min{injjjr(S), ^ 7 =}. Since M and M are (1 + 5r _ 1 ,5)-quasi-isometric 

and r < p, dl) implies that for every x £ M there exists x £ M such that 

dcHiBp 1 (x), Bf (x)) < C5r~ 1 p. 
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Hence by the first part of the proposition we have 

inj M (x) > p- C5r~ 1 p = (1 - CSr~ 1 )p. 

Since x is an arbitrary point of M, (|1.13D follows. □ 
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